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(57) A modular optical switch includes a set of opti- 
cal switch modules connected in a mesh, a master con- 
troller for the whole optical node and a switch-module 
controller for each of the optical switch modules. The 
optical switch modules receive optical signals from, and 
transmit optical signals to, edge nodes based on con- 
nection requests received from the edge nodes. The 
master controller acts to select a path, using a simple 
or compound time-slot matching process, through the 



mesh of switch modules for each optical signal related 
to a connection request. Advantageously, the optical 
switch modules are fast switching, enabling the use of 
time-sharing schemes such as TDM, and the modular 
optical core node is made practical by efficient path se- 
lection at the master controller. A hybrid modular switch 
may Include both optical and electronic switch modules, 
a master controller, and a switch-module controller for 
each of the switch modules. 
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Description 

FIELD OF THE INVENTION 

5 [0001] The present invention relates to communication networks and, more particularly, to a modular, high-capacity 
switch for use In such networks. 

BACKGROUND 

10 [0002] Atypical data network comprises a number of source nodes, each source node receiving traffic from numerous 
traffic sources, and a number of sink nodes, each sink node transmitting data to numerous traffic sinks. The source 
nodes can be connected to the sink nodes directly or through core nodes. Source nodes and sink nodes are usually 
paired so that a source node and an associated sink node are included within a single edge node. 
[0003] The capacity of a given data network may be defined by the capacities of the edge nodes and the core nodes. 
IS Each link between nodes (e.g., a link from a source node to a core node or a link from a core node to a sink node) 
may comprise multiple channels. Where an optical link is divided into channels using Wavelength Division Multiplexing 
(WDM), each channel in the optical link is defined by a separate modulated wavelength. Additionally, the link, or chan- 
nels of the link, may be sub-divided in time using Time Division Multiplexing (TDM). In a TDM scheme, a set number 
of time slots makes up a TDM frame. A core node that acts to connect source nodes to sink nodes using multi-channel 
20 links may switch an entire incoming link to art outgoing link (link switching), an incoming channel to an outgoing channel 
(channel switching) or an incoming sub-divided channel to an outgoing sub-divided channel (time-sharing switching). 
In time-sharing switching, a received optical signal may be switched to a first sink node during one time interval and 
switched to a second sink node during an immediately subsequent time inten/al. Accordingly, a switch must be capable 
of changing the destination of a received signal in a very short period of time. Switches capable of such switching are 
25 said to be capable of fast-switching. 

[0004] Fast-switching, high-capacity, optical switches are needed to realize an agile optical-core network, that is, an 
optical-core network that may adjust swiftly to changes in desired connectivity between edge nodes. A core node 
comprises one or more switches. The degree of a switch (or a core node) is a measure of a number of input ports and 
output ports. To construct a high degree, fast-switching, optical core node using optical-switch modules of smaller 
30 sizes, It is known to use an aggregation of the optical-switch modules in a multi-stage arrangement. Where a typical 
space switch need only be configured to connect an input port to an output port, a core node comprising several optical- 
switch modules may need to be configured to connect an input port in a first switch to an output port of a second switch. 
Such a need may be met by establishing a "path" across the core node from the first stage to the second stage. If a 
direct connection from the first stage to the second stage is unavailable, such a path may traverse one or more inter- 
ns mediate stages. 

[0005] The construction of a high-capacity, multi-stage, optical core node using optical-switch modules would be 
considered easily manageable if high path-establishment speed was not a requirement. For instance, high path-es- 
tablishment speed is not a requirement in the case of conventional cross connectors. However, a relatively long delay 
required to change a path across an optical core node may preclude the use of such an optical core node for time- 

40 sharing switching schemes such as TDM switching or burst switching. In the absence of such time-sharing switching 
schemes, the multi-stage optical core node becomes a channel-switching cross-connector and a network based on 
such a core node may be forced to perform such measures as multiple edge-to-edge hops and/or intra-core hops to 
interconnect certain edge nodes. These measures may significantly increase the complexity, and degrade the perform- 
ance, of a network. Ideally, every source node in a given network should be able to reach every sink node in the given 

45 network either directly or via a single core node. However, when none of the core nodes to which a source node 
connects subsequently connects to a desired sink node, there may be a necessity to send a data stream to an inter- 
mediate edge node that connects to a core node thai does connect to the desired sink node. Such use of an intemaediate 
node may be called "tandem switching". 

[0006] The number of sink nodes that a given source node can reach without switching at an intermediate edge node 
so is referenced herein as the reach index of the source node. Slow switching, in contrast to the fast switching mentioned 
above, may limit the reach of a source node and may necessitate tandem switching for data streams of low traffic 
intensity. This is due to the coarse granularity resulting from slow switching, which increases the number of hops. A 
data stream is defined as data that is transfeaed from a source node to a sink node over a partfcular path (e.g.. via a 
particular core node). Tandem switching may be required when channel switching is used in the core because the 
ss number of channels emanating from a source node would typically be smaller than the number of sink nodes address- 
able by the source node. 

[0007] Hereinafter, a link from a source node to a core node is called an uplink and a link from a core node to a sink 
node is called a downlink. A channel in an uplink is called an upstream channel and a channel in a downlink is called 
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a downstream channel. Data carried by an uplink or an upstream channel is identified as upstream data and data 
carried by a downlink or a downstream channel is identified as downstream data 

[0008] An electronic edge node can realize a high reach index due to the inherent fast-switching ability of electronic 
edge nodes, which enables capacity division into small units. As stated above, the reach Index of a reference edge 
node is the number of other edge nodes that can be reached by the reference edge node, directly or through core 
nodes. An optical core node, however, is preferably bufferless and, hence, requires precise time coordination with the 
source nodes in order to create paths of fine granularities through time sharing. To summarize, a fast-switching core 
node may independently route individual data blocks defined using time-sharing switching techniques such as TDM 
switching or burst switching. Realizing a high-capacity network requires high -capacity, fast-switching core nodes. Pref- 
erably, the core nodes include optical switches. 

[0009] McGuire(U.S. Patent No. 5,889,600, issued March 30, 1999) discloses a modular switch operated in a channel 
switching mode comprising a plurality of star couplers, connecting to a plurality of input WDM links and a plurality of 
output WDM links. Each WDM link comprises a number of wavelength channels equal to the number of star couplers. 
Each input WDM link is demultiplexed into its constituent wavelength channels and each of the individual wavelength 
channels connects to an input port of one of the star couplers. Wavelength converters are provided at the output ports 
of the star couplers. Each output WDM link carries multiplexed optical signals received from an output port of each 
star coupler. The modular switch allows a wavelength channel from any input port to connect to any of wavelength 
channel in a subset of the output ports of the star couplers. For example, using 32x32 star couplers, 32 WDM input 
links and 32 WDM output links, each input link and each output link carrying 32 wavelength channels, a specific wave- 
length channel in an input link can be switched to any one of a subset of 32 of the 1 ,024 output ports of the 32 star 
couplers. 

[0010] Multi-stage, optical switch structures that switch channels are known in the prior art. For example, Kuroyanagi 
(US Patent No. 6,154,583, issued November 28, 2000) describes an optical switch configured as a multi-stage circuit, 
with each of the stages including a plurality of space switches. An arrangement of optical amplifiers is also described. 
Such structures, however, are limited to channel switching granularity, which may be considered too coarse for future 
applications. 

[0011] Bala et al. (US Patent No. 6,335,992, issued January 1 , 2002) describe a scalable multi-stage optical cross- 
connect. The multi-stage optical cross connect comprises a plurality of first stage switch matrices, a plurality of middle 
stage switch matrices having input ports and output ports, and a plurality of last stage switch matrices having input 
ports and output ports. Each of the first stage switch matrices has a plurality of input ports, each input port receiving 
an input communication signal, and a larger number of output ports, where the first stage switch matrices switch the 
input communication signals to selected output ports. The input ports of the middle stage switch matrices are coupled 
to the output ports of the first stage switch matrices for receiving communication signals output from the first stage 
switch matrices. The middle stage switch matrices switch communications signals received at their input ports to their 
output ports. The input ports of the last stage switch matrices are coupled to the output ports of the middle stage switch 
matrices for receiving communication signals output from the middle stage switch matrices. The last stage switch 
matrices switch communications signals received at their input ports to their output ports. In addition, the middle stage 
itself can be recursively a multistage switch. 

[001 2] Neither of the above two disclosures suggests the use of a time-sharing scheme, such as TDM, in a bufferiess 
multi-stage switching node. A node structure that permits scalability and can employ time-sharing techniques is re- 
quired, and methods of circumventing the difficulty of scheduling signal transfer in bufferless, multi-stage, time-sharing, 
optical switching nodes are required to enable the realization of such nodes and, ultimately, an efficient network that 
scales to capacities of the order of several petabits/second. 

SUMMARY OF THE INVENTION 

[001 3] A modular, high-capacity, optical core node, which is exemplary of the modular, high-capacity switch proposed 
herein, includes several medium-capacity, fast-switching, optical switch modules enables the construction of an agile, 
scalable optical network. Each optical switch module is provided with a controller that includes time-locking circuitry 
and adaptive-configuration circuitry. The optical switch modules are connected to each other in a mesh structure and 
each uplink has an internal time-switched path to a respective switch-module controller. The switch-module controllers 
are communicatively coupled to a master controller, which hosts a fast path-selection device. The switch-module con- 
trollers also exchange time-locking data with external nodes, receive connection requests from external nodes, com- 
municate the time-slot-allocation requests to the master controller (based on the connection requests), receive de- 
scriptions of time division multiplexed frames from the master controller and configure the internal connectivity of the 
optical switch modules based on the appropriate time division multiplexed frame descriptions. 
[0014] Advantageously, the master controller of the modular, high-capacity switch is configured so that time-slot- 
allocation requests that may be satisfied by a second-order time-slot matching process (for a direct path between 
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switch modules) may be processed before time-slot-allocation requests that may be satisfied by a third-order time-slot 
matching process (for an indirect path between switch modules). This order of processing avoids unnecessary diversion 
of a connection to an indirect path. Intra-switch-module paths, which do not require traversing inter-module links, can 
be processed last. 

5 [0015] In accordance with a first aspect of the present invention, there is provided a modular switch comprising 
a plurality of switch modules, each of said plurality of switch modules including: 

a plurality of inlet ports, each of said plurality of inlet ports adapted to communicatively couple to an optical link 
10 from an external node; 

a plurality of outlet ports, each of said plurality of outlet ports adapted to communicatively couple to an optical link 
to an external node; 

IS 3L plurality of outbound ports, each of said plurality of outbound ports communicatively coupled to an inter-modular 

link to an other of said plurality of switch modules; 

a plurality of inbound ports, each of said plurality of Inbound ports communicatively coupled to an inter-modular 
link from an other of said plurality of switch modules; and 

20 

a master controller including a path selection device adapted to match vacant time slots in calendars associated 
with particular ones of said inlet ports^ said outlet ports and selected outbound ports of said switch modules in 
response to receiving a connection request said connection request specifying an Inlet port, an outlet port, and a 
number of time slots in a time-division-multipiex frame. 

25 

[0016] In accordance with a second aspect of the present invention, there is provided a method of controlling an 
optical switch module in a modular optical switch, said method comprising: 

receiving a connection request, from an edge node, via said optical switch module, where satisfying said connection 
30 request requires use of an inter-modular link to another optical switch module in said modular optical switch; 

sending a time-slot-altocatlon request to a master controller of said modular optical switch, where said time-slot- 
allocation request is based at least in part on said connection request; 

35 receiving a plurality of connection schedules, from said master controller in response to said sending said time- 

slot-allocation request, wherein each of said plurality of connection schedules is associated with a port of said 
optical switch module; and 

sending, to said optical switch module, commands adapted to configure an internal connectivity of said optical 
40 switch module according to said connection schedules. 

[0017] In accordance with a third aspect of the present invention, there is provided a modular switch comprising: 

a plurality of switch modules, each of said plurality of switch modules including: 

45 

a plurality of inlet port means for communicatively coupling to an optical link from an external node; 

a plurality of outlet port means for communicatively coupling to an optical link to an external node; 

so a plurality of outbound port means for communicatively coupling to an inter-modular link to an other of said 

plurality of switch modules; 

a plurality of inbound port means for communicatively coupling to an inter-modular link from an other of said 
plurality of switch modules; and 



55 



a master controller including a means for matching vacant time slots in calendars associated with particular 
ones of said Inlet port means, said outlet port means and selected outbound port means of said switch modules 
in response to receiving a connection request, said connection request specifying an Inlet port means, an 
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outlet port means, and a number of time slots in a time-division-multiplex frame. 
PREFERRED EMBODIMENTS OF THE INVENTION 

5 [0018] According to a preferred embodiment of the present invention, there is provided an optical core node. The 
optical core node includes a plurality of optica! switch modules connected as a mesh, wherein each of the optical switch 
modules is adapted to communicate with an external node over a corresponding outer link, communicate with another 
optical switch module over an inner link. The optical core node also includes a plurality of switch-module controllers, 
each of the plurality of switch-module controllers associated with one of the plurality of optical switch modules, each 

10 of the switch-module controllers including a time-locking unit adapted to time-lock the switch-module controller with 
the external node with which the switch-module controller communicates over the outer link. 

[0019] According to a further preferred embodiment of the present invention, there is provided, in a switching node 
having a plurality of input ports and a plurality of output ports, the switching node having a controller that includes a 
route-set memory storing a route set of routes for each pair of input and output ports, and a plurality of cascaded 

15 schedulers, each scheduler associated with a result memory and operable to schedule connections for a specified 
sub-set of time slots within a slotted time frame, a method of scheduling connections in response to receiving connection 
requests. Each connection request includes a connection descriptor having a connection identifier and specifying one 
of the plurality of input ports, one of the plurality of output ports and a requested number of time slots in a slotted time 
frame. The method includes determining a number of pending time slots, the number of pending time slots initially 

20 equated to the requested number of time slots, selecting a cun-ent scheduler, starting with the first scheduler, to allocate 
allocable time slots and place identifiers of the allocable time slots in the result memory, selecting a subsequent sched- 
uler as the current scheduler and cyclically reading content of the result memory associated with each of the plurality 
of cascaded schedulers. 

[0020] In preferred features of this method, routes in each route set are classified into at least one category such 

25 that the allocable time slots may only be allocated within a selected one of the at least one categories. Further, the at 
least one categories may be sorted according to a descending order of preference, a first sorted category being the 
most preferred, and the use of routes may be allowed in a preferred category for allocating the allocable number time 
slots, where the pending number of time slots is not zero and the current scheduler is the last of the plurality of cascaded 
schedulers, repeating the selecting the schedulers allowing use of routes in a next preferred category of the at least 

30 one categories. Notably, the switching node may be a space switching node including switching modules an-anged in 
a meshed structure and the schedulers may be time-slot matching units. The at least one categories may include: 
routes traversing two of the switching modules; routes traversing three of the switching modules; and routes that 
traverse one of the switching modules. The method may also include releasing the allocable time slots if the pending 
number of pending time slots is still greater than zero after allowing use of all of the at least one categories; and 

35 generating a rejection message. The method may also include reporting the identifiers of the allocable time slots for 
the connection request to an entity from which the connection request was received. Additionally, if the pending number 
of time slots is greater than zero after using all of the categories, the identifiers of the allocable time slots may be 
retained and a new connection request may be initiated specifying the pending number of time slots. Remarkably, 
routes of the same category in the route set for each pair of input and output ports may be selected cyclically for 

40 consecutive connection requests associated with the each pair of input and output ports, allocating the allocable time 
slots may include selecting the allocable time slots to be as close as possible to a reference time slot in the slotted 
time frame and a size of the specified sub-set of time slots may be adapted so as to balance processing loads of the 
plurality of cascaded schedulers. 

[0021] According to a further preferred embodiment of the present Invention, there is provided, in a switching node 
45 comprising inlet ports, outlet ports and inner links, where a route set is designated for each pair of inlet and outlet ports 
and includes at least one route, each of the at least one route traversing two of the inner links, each inlet port adapted 
to receive time-multiplexed signals, each signal occupying at least one time-slot in a time frame having o time slots, 
each inlet port, outlet port and inner link associated with a calendar of a > 1 cells, each cell corresponding to a time 
slot and containing an indication of an occupancy state, a method of scheduling a transfer of data, in a specified number 
50 of time slots, from a given inlet port to a given outlet port using a designated route set. The method including selecting 
a candidate time slot from the a time slots, where the selecting is performed in a predetermined order, and, for the 
candidate time slot, determining, from the calendar associated with the given inlet port, an occupancy state of the given 
inlet port for the candidate time slot, determining, from the calendar associated with the given outlet port, an occupancy 
state of the given outlet port for the candidate time slot, selecting a candidate route in the designated route set for 
55 consideration, where the selecting is performed in a cyclic order and where the consideration comprises determining, 
from the calendar associated with a first inner link in the candidate route, an occupancy state of the first inner link in 
the candidate route for the candidate time slot and, where the occupancy state of the given inlet port, the given outlet 
port and the first inner link in the candidate route is detemnined as vacant, considering the candidate route a first 
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available route and the candidate time slot an allocable time slot. 

[0022] In preferred features of this method, a vacancy value may be associated with each of the candidate routes, 
where a vacancy value for a given route is a total number of vacant time slots per time frame. The occupancy state 
determining and candidate route selecting may be repeated to find a second available route from the candidate routes 
and the second available route selected If the vacancy value associated with the second available route is greater than 
the vacancy value associated with the first available route. Additionally, the candidate time slot selecting, the occupancy 
state determining and candidate route selecting may be repeated while the specified number of time slots exceeds the 
number of allocable time slots. Notably, the predetermined order may be a sequential order of the s lime slots. 
[0023J According to a further preferred embodiment of the present invention, there is provided a method of matching 
vacant time slots in a plurality of calendars of time slots, each of the plurality of calendars associated with a port at 
one of a plurality of optical switch modules in a modular optical switch. The method includes receiving a request for 
scheduling a time slot that is vacant in a first of the optical switch modules and a second of the optical switch modules, 
the request specifying an inlet port of the first of the optical switch modules, used to receive a data block from a first 
edge node, and an outlet port of the second of the optical switch modules, used to transmit the data block to a second 
edge node and performing a second-order matching process. The second-order matching process includes examining 
occupancy of the inlet port and the outlet port over sequential time slots until a particular time slot is found for which 
both the inlet port and the outlet port are vacant, where the examining begins at an initial time slot and. where the 
particular time slot is found for which both the inlet port and the outlet port are vacant, assessing occupancy of a first 
inter-modular link connecting the first of the optical switch modules to the second of the optical switch modules for the 
particular time slot. 

[0024] In preferred features of this method, where the first inter-modular link is vacant for the particular time slot, the 
plurality of calendars may be updated to result in updated calendars that reflect use of the particular time slot to satisfy 
the request. Upon completion of the second-order matching process, the updated calendars may be sent to the first 
of the optical switch modules and the second of the optical switch modules. Notably, where the first inter-modular link 
is occupied for the particular time slot, the examining and. the assessing may be continued, starting at the time slot 
subsequent to the particular time slot. The second-order matching process may be limited to a predefined range of 
time slots and wherein, if the particular time slot is not found, the request is fonA/arded for a second-order matching 
process in respect of a subsequent predefined range of time slots. The method may include a third-order matching 
process, the third-order matching process including examining occupancy of the inlet port and the outlet port over 
sequential time slots until a new particular time slot is found for which both the inlet port and the outlet port are vacant, 
where the examining begins at the initial time slot, where the new particular time slot is found for which both the inlet 
port and the outlet port are vacant, assessing occupancy of a second inter-moduiar link connecting the first of the 
optica! switch modules to a third of the optical switch modules and a third inter-moduiar link connecting the third of the 
optical switch modules to the second of the optical switch modules for the new particular time slot. Remarkably, where 
both the second inter-modular ink and the third inter-modular link are vacant for the particular time slot, the plurality of 
calendars may be updated to result in updated calendars that reflect use of the new particular time slot to satisfy the 
request. Additionally, upon completion of the third-order matching process, the updated calendars may be sent to the 
first of the optical switch modules, the second of the optical switch modules and the third of the optical switch modules. 
[0025] According to a further prefen-ed embodiment of the present invention, there is provided a method of matching 
vacant time slots in a plurality of calendars of time slots., each of the plurality of calendars associated with a port at 
one of a plurality of optical switch modules in a modular optical switch. The method includes receiving a request for 
scheduling a time slot that is vacant in a first of the optical switch modules and a second of the optical switch modules, 
the request specifying an inlet port of the first of the optical switch modules, used to receive a data block from a first 
edge node, and an outlet port of the second of the optical switch modules, used to transmit the data block to a second 
edge node and performing a third-order matching process. The third-order matching process includes examining oc- 
cupancy of the inlet port and the outlet port for sequential time slots until a particular time slot Is found for which both 
the inlet port and the outlet port are vacant, where the examining begins at an initial time slot, where the particular time 
slot is found for which both the inlet port and the outlet port are vacant, assessing occupancy of a second inter-modular 
link connecting the first of the optical switch modules to a third of the optical switch modules and a third inter-modular 
link connecting the third of the optical switch modules to the second of the optical switch modules for the particular 
time slot. 

[0026] According to a further preferred embodiment of the present invention, there is provided an apparatus for 
matching vacant time slots in a plurality of calendars of time slots, each of the plurality of calendars associated with a 
port at one of a plurality of optical switch modules in a modular optical switch. The apparatus includes a request buffer 
adapted to receive a request for allocating a time slot that is vacant in a first of the optical switch modules and a second 
of the optical switch modules, the request specifying an inlet port of the first of the optical switch modules, used to 
receive a data block from a first edge node, and an outlet port of the second of the optical switch modules, used to 
transmit the data block to a second edge node, a path finder adapted to perform a second-order matching process for 
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a given request from the request buffer The second-order matching process includes examining occupancy of the inlet 
port and the outlet port for sequential time slots until a particular time slot is found for which both the inlet port and the 
outlet port are vacant, where the examining begins at an initial time slot and, where the particular time slot Is found for 
which both the inlet port and the outlet port are vacant, assessing occupancy of a first inter-modular link connecting 

5 the first of the optical switch modules to the second of the optical switch modules for the particular time slot and, where 
the first inter-modular link is vacant for the particular time slot, generating a result record identifying the given request 
the particular time slot and the first inter-modular link. The apparatus also includes a connection-control drcuit adapted 
to receive the result record, update the plurality of calendars to result in updated calendars that reflect use of the 
particular time slot to satisfy the request and send the updated calendars to the first of the optical switch modules and 

10 the second of the optical switch modules. 

[0027] In preferred features of this apparatus, a result buffer may be adapted to. upon completion of the time-slot 
matching process, store the result record, a selector may be adapted to fon^^ard result records received from the path 
finder to the result buffer, fonward requests to a subsequent request buffer when a number of allocable time slots is 
less than a specified number and receive release requests. Notably, the time-slot matching process may be a first- 

15 order matching process including examining occupancy of the inlet port and the outlet port for sequential time slots 
until a particular time slot is found for which both the inlet port and the outlet port are vacant, where the examining 
begins at an initial time slot. The time-slot matching process may also be a second-order matching process including 
examining occupancy of the Inlet port and the outlet port for sequential time slots until a particular time slot is found 
for which both the Inlet port and the outlet port are vacant, where the examining begins at an initial time slot and where 

20 the particular time slot is found for which both the inlet port and the outlet port are vacant, assessing occupancy of a 
first inter-modular link connecting the first of the optical switch modules to the second of the optical switch modules for 
the particular time slot, where the first inter-modutar link is vacant for the particular time slot, generating a result record 
identifying the given request the particular time slot and the first inter-modular link. The time-slot matching process 
may further be a third-order matching process including examining occupancy of the inlet port and the outlet port over 

25 sequential time slots until a new particular time slot is found for which both the inlet port and the outlet port are vacant, 
where the examining begins at an initial time slot, where the new particular time slot is found for which both the inlet 
port and the outlet port are vacant, assessing occupancy of a second inter-moduiar link connecting the first of the 
optical switch modules to a third of the optical switch modules and a third inter-modular link connecting the third of the 
optical switch modules to the second of the optical switch modules for the new particular time slot and, where the 

30 second and third Inter-modular links are vacant for the particular time slot, generating a result record identifying the 
given request, the particular time slot and the first inter-modular link. 

[0028] According to a further preferred embodiment of the present invention, there is provided a method of selecting 
a path through a modular optteal switch.. The method includes receiving a request, where the request Identifies a 
requested number of time slots, an inlet port of a first switch module and an outlet port of a second switch module, 

35 responsive to the receiving the request, comparing a state map, specific to a particular time slot, associated with the 
inlet port to a state map, specific to the particular time slot, associated with the outlet port to find a matching time slot 
that is vacant in both the inlet port and the outlet port, if the comparing provides the matching time slot, wherein the 
state maps associated with the respective ports indicate vacancy in the particular time slot, recording the matching 
time slot in a result record and transmitting the result record to a controller of the modular optical switch. 

40 [0029] In preferred features of this method, the comparing may be repeated for a subsequent time slot of a first 
predetennined set of sequential time slots either until the matching time siot has been found or until the state maps 
associated with each of the fiist predetermined set of sequential time slots have been compared. Additionally, respon- 
sive to the receiving the request, a local index may be initialized to a value of the requested number of time slots and, 
where the matching time slot is found, the local index may be reduced by one and an indication of successful matching 

45 may be generated. The comparing, reducing and generating may be repeated for each of the sequential time slots of 
the first predetennined set of sequential time slots either until the local index has been reduced to zero or until the 
state maps associated with each sequential time slot of the first predetermined set of additional time slots have been 
compared. Notably, where each of the first predetermined set of sequential time slots have been compared and the 
local index exceeds zero, the comparing, reducing and generating may be repeated for sequential time slots of a 

50 second predetermined set of sequential time slots either until the local index has been reduced to zero or until the state 
maps associated with each additional time slot of the second predetennined set of sequential time slots have been 
compared. Remarkably, where each of the first predetennined set of sequential time slots have been compared and 
the local index exceeds zero, a request may be output with a requested number of time slots set to a value of the local 
index. 

55 [0030] According to a further preferred embodiment of the present invention, there is provided a path selection ap- 
paratus. The path selection apparatus includes a plurality of matching units, where each of the plurality of matching 
units is adapted to receive a connection request, where the connection request identifies a requested number of time 
slots, an inlet port of a first switch module and an outlet port of a second switch module, responsive to the receiving 
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the time-slot-allocation request, compare a state map, specific to a particular time slot, associated with the inlet port 
to a state map, specific to the particular time slot, associated with the outlet port to find a matching time slot that is 
vacant in both the inlet port and the outlet port and, if the comparing provides the matching time slot, wherein the state 
maps associated with the respective ports indicate vacancy in the particular time slot, record the matching time slot in 
5 a result record. The path selection apparatus also includes a plurality of result buffers, each of the a plurality of result 
buffers adapted to receive a result record from an associated one of the plurality of matching units and a cyclic selector 
adapted to select a single result record at a time from each of the plurality of result buffers under control of the cyclic 
selector 

[0031] In preferred features of this apparatus, each of the plurality of matching units includes: a request buffer for 

JO receiving the time-slot-allocation request; a memory adapted to store the state map associated with the inlet port and 
the state map associated with the outlet port; a path finding processor and a selector Where the path finding processor 
may be adapted to: receive the connection request from the request buffer; access the memory; and perform the 
comparing to result in a given result record. The selector may be adapted to: receive the given result record as a 
product of the comparing; and transmit the result record to the associated result buffer. 

t5 [0032] According to a further preferred embodiment of the present invention, there is provided a data structure for 
simple and compound time-slot matching over a number of time slots to be scheduled, the data structure for use in a 
switch module in a modular switch comprising a plurality of switch modules having inlet ports and outlet ports. The 
data structure includes a first matrix having a number of rows equal to a first product of a maximum number of the inlet 
ports and a maximum number of switch modules and a number of columns equal to the number of time slots, a second 

20 matrix having a number of rows equal to a second product of a maximum number of the outlet ports and a maximum 
number of switch modules and a number of columns equal to the number of time slots, a third matrix having a number 
of rows equal to a third product of the maximum number of the inlet ports and a maximum number of time slots to be 
considered and a number of columns equal to a maximum number of the outlet ports and a fourth matrix having a 
number of rows equal to a fourth product of the maximum number of the outlet ports and a maximum number of time 

25 slots to be considered and a number of columns equal to the maximum number of inlet ports. 

[0033] In pretended features of the data structure, each element in each of the matrices is a 1 -bit word indicating an 
occupancy state. The data structure may be used for a method of first-order time-slot matching that includes finding 
an entry in a row in the first matrix indicating a vacant occupancy state for an inlet port of a given one of the plurality 
of switch modules and finding an entry in a row in the second matrix indicating a vacant occupancy state for an outlet 

50 port of the given one of the plurality of switch modules. The data structure may also be used for a method of second 
order time-slot matching including finding an entry in a row in the first matrix indicating a vacant occupancy state for 
an inlet port of a first one of the plurality of switch modules, finding an entry in a row in the second matrix indicating a 
vacant occupancy state for an outlet port of a second one of the plurality of switch modules and finding an entry in a 
row in the third matrix indicating a vacant occupancy state for an inner link to the second one of the plurality of switch 

55 modules. The data structure may additionally be used for a method of third-order time-slot matching that includes 
finding an entry in a row in the first matrix indicating a vacant occupancy state for an inlet port of a first one of the 
plurality of switch modules, finding an entry in a row in the second matrix indicating a vacant occupancy state for an 
outlet port of a second one of the plurality of switch modules, finding an entry in a row in the third matrix indicating a 
vacant occupancy state for an inner link to a third one of the plurality of switch modules and finding an entry in a row 

40 in the fourth matrix indicating a vacant occupancy state for an inner link to the second one of said plurality of switch 
modules. 

[0034] The summary of the invention does not necessarily disclose all the features essential for defining the invention; 
the invention may reside in a sub-combination of the disclosed features. 

[0035] Other aspects and features of the present invention will become apparent to those of ordinary skill in the art 
45 upon review of the following description of specific embodiments of the invention in conjunction with the accompanying 
figures. 

[0036] There now follows, by way of example only, a detailed description of preferred embodiments of the present 
invention in which: 

50 BRIEF DESCRIPTION OF THE DRAWINGS 

[0037] In the figures which illustrate example embodiments of this invention: 

FIG. 1 illustrates a network of electronic edge nodes interconnected by an optical core that switches entire individual 
55 wavelength channels according to an embodiment of the present invention; 

FIG. 2 illustrates a networi< siniilar to the networi< of FIG. 1, with the exception that the capacity of each link, 
wavelength channel-band or wavelength channel is divided Into units, using time-division multiplexing (TDM) for 
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example, and the optical core switches individual units according to an embodiment of the present invention; 

FIG. 3 qualitatively illustrates the effect of switch granularity on edge-to-edge hopping and overaill network effi- 
ciency; 

5 

FIG. 4 Illustrates a modular core-node structured as a mesh of switch modules according to an embodiment of the 
present invention; 

FIG. 5 illustrates a prior-art cascade of space switches connecting two electronic edge nodes wherein de-coupling 
10 buffers interpose the space switches; 

FIG. 6 illustrates a single space switch in the modular core node of FIG. 4 connecting two electronic edge nodes; 

FIG. 7 illustrates a cascade of two space switches in the modular core node of FIG. 4 connecting two electronic 
15 edge nodes without a decoupling buffer; 

FIG. 8 illustrates a cascade of three space switches in the modular core node of FIG. 4 connecting two electronic 
edge nodes without decoupling buffers; 

20 FIG. 9A illustrates a prior-art multiple first-order time-slot matching process; 

FIG. 9B illustrates a compound second-order or third-order time-slot matching process under moderate time-slot 
packing according to an embodiment of the present invention; 

25 FIG. 9C illustrates a compound second-order or third-order time-slot matching process under ideal time-slot pack- 

ing according to an embodiment of the present invention; 

FIG. 10 illustrates a switch module and switch-module controller for use as a building block in a high-capacity 
optical node according to an embodiment of the present invention; 

30 

FIG. 11 illustrates a modular, high -capacity, optical core node including a master controller that operates in coor- 
dination with the building blocks of FIG. 10 according to an embodiment of the present invention; 

FIG. 12 illustrates a mesh structure with inter-modular connections between switch modules In a modular, high- 
35 capacity, optical core node according to an embodiment of the present Invention; 

FIG. 13 illustrates a signal flow diagram according to an embodiment of the present Invention; 

FIG. 14 illustrates steps of formulating a connection request at a controller of a switch module according to an 
40 embodiment of the present invention; 

FIG. 15 illustrates a data structure for use in the application of a second-order time-slot matching process for the 
mesh structure of switch modules of FIG. 12 according to an embodiment of the present invention; 

45 FIG. 16 illustrates the use of the data structure of FIG. 15 in the application of a third-order time-slot matching 

process for the mesh of switch modules of FIG. 12 according to an embodiment of the present invention; 

FIG. 1 7A illustrates temporal correlation of a first pair of weakly correlated inter-modular links in the mesh structure 
of FIG, 12; 

so 

FIG. 17B illustrates temporal correlation of a second pair of weakly correlated inter-modular links in the mesh 
structure of FIG. 12; 

FIG. 18A illustrates temporal congelation of a first pair of highly correlated inter-modular links in the mesh structure 
55 of FIG. 12 with a first link-occupancy gradient; 

FIG. 1 SB illustrates temporal correlation of a second pair of highly correlated inter-modular links in the mesh struc- 
ture of FIG. 12 with a second link-occupancy gradient; 
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FIG. 19A illustrates an inter-modular link labeling scheme, for inter-modular links outbound from a third switch 
module and inbound to a fifth switch module, in an exemplary five-modute node according to an embodiment of 
the present invention; 

5 FIG. 19B illustrates the inter-modular link labeling scheme, for inter-modular (inks inbound to the third switch mod- 

ule of FIG. 19A and outbound from the fifth switch module of FIG. 19A according to an embodiment of the present 
invention; 

FIG. 20A illustrates a port labeling scheme and associated state map, for port connecting to links inbound from 
10 edge nodes to the switch modules of FIG. 19A, as a matrix, according to an embodiment of the present invention; 

FIG. 20B illustrates a port labeling scheme and associated state map, for port connecting to links outbound from 
to the switch modules of FIG. 1 9A to edge nodes, as a matrix, according to an embodiment of the present invention; 

15 FIG. 21 A illustrates a port labeling scheme and associated state map, for outbound ports of the switch modules 

of FIG. 19A, as a matrix, according to an embodiment of the present invention; 

FIG. 21 B illustrates a port labeling scheme and associated state map, for Inbound ports of the switch modules of 
FIG. 19A to edge nodes, each a transpose of one of the matrices of FIG. 21 A, according to an embodiment of the 
20 present invention; 

FIG. 22 illustrates the control^-data transfer from switch modules to the master controller according to an embod- 
iment of the present invention ; 

25 FIG. 23 illustrates a path selection device using an array of schedulers for a cascaded search according to an 

embodiment of the present Invention; 

FIG. 24 illustrates one of the schedulers of FIG. 23 according to an embodiment of the present invention; 

30 FIG. 25 illustrates a result record generated by the scheduler of FIG. 24; 

FIG. 26 and FIG. 27 illustrate a data structure for use in scheduling a connection in the modular core node of FIG. 4; 

FIG. 28 illustrates state maps as used to select paths in a first-order, second-order, and third-order matching 
35 processes according to an embodiment of the present invention; 

FIG. 29 illustrates steps in a first-order second-order, or third-order matching method to be used by the scheduler 
of FIG. 24 according to an embodiment of the present invention; 

40 FIG. 30 illustrates details of a step of third-order matching to be used by the scheduler of FIG. 24 according to an 

embodiment of the present invention; 

FIG. 31 illustrates a modular switch structured as a mesh of electronic switch modules and optical switch modules 
according to an embodiment of the present invention; and 

45 

FIG. 32 is a table indicating the order of time-slot matching processes in a modular switch comprising electronic 
and/or optical switch modules according to an embodiment of the present invention. 

DETAILED DESCRIPTION 

50 

[0038] The following description is of a preferred embodiment by way of example only and without limitation to the 
combination of features necessary for carrying the invention into effect. 

[0039] A modular optical switch includes a set of optical switch modules connected in a mesh, a master controller 
for the whole optical node and a switch-module controller for each of the optical switch modules. The optical switch 
55 modules receive optical signals from, and transmit optical signals to, edge nodes based on connection requests re- 
ceived from the edge nodes. The master controller acts to select a path, using a simple or compound time-slot matching 
process, through the mesh of switch modules for each optical signal related to a connection request. Advantageously, 
the optical switch modules are fast switching, enabling the use of time-sharing schemes such as TDM, and the modular 
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optical core node is made practical by efficient path selection at the master controller. A hybrid modular switch may 
include both optical and electronic switch modules, a master controller, and a switch-module controller for each of the 

switch modules. 

[0040] A first exemplary network 100 is illustrated in FIG. 1 and includes a number of electronic edge nodes 102A, 

5 102B. 102C. 102D. 102E. 102F, 102G, 102H, 102P, 102Q. 102R, 102S. 102T. 102U. 102V and 102W (referenced 
herein collectively or individually as 102) interconnected by an optical core 106 that comprises a number of optical 
core nodes 104, two of which, 104A and 104B (referenced herein collectively or individually as 104), are illustrated. 
The optical core nodes 104 may have different levels of switching granularities. Each of the electronic edge nodes 102 
in the first exemplary network 100 of FIG. 1 has a small number of fiber links to the optical core 106, each fiber link 

10 carrying wavelength-division-multiplexed (WDM) optical signals, and is hereinafter referenced as a WDM link. The 
electronic edge nodes 102 support traffic sources and traffic sinks (not illustrated). The optical core nodes 104 switch 
wavelength channels, thus an electronic edge node 102 having a number U of WDM links, each link carrying W 
wavelength channels, can reach at most L x W other electronic edge nodes 102 at any given time. FIG. 1 illustrates 
- a case where an example electronic edge node 102U has two WDM links each link carrying two wavelength channels. 

15 At a given instant of time, the four wavelength channels emanating from edge node 102U may be connected to any 
four edge nodes, as indicated by the solid lines connecting edge nodes 102G, 102H, 102R, 102S to the optical core 106. 
[0041] In a high-capacity network, the number of edge nodes can be of the order of a thousand and the number of 
WDM links from an edge node 102 would be of the order of two, with each WDM link carrying 32 wavelength channels. 
Thus the edge node can reach up to 64 other edge nodes at any given time. A typical electronic edge node 102, 

20 however, may require connections to a large number of other electronic edge nodes 102. Such high connectivity can 
then be realized indirectly by multiple edge-to-edge hops, where each edge-to-edge hop requires traversing the network 
core, possibly in several core hops. The electronic edge nodes 1 02 considered in this disclosure are based on electronic 
switches and can, therefore, provide a fine switching granularity. 

[0042] If the number of edge nodes in a given network, similar to that of FIG. 1 is much larger than the number of 
25 channels connecting an edge node to the rest of the network, then the majority of destination edge nodes can only be 
reached by edge-to-edge hopping where the path between two electronic edge nodes 102 traverses intennediate. 
electronic edge nodes 102. 

[0043] FIG. 2 illustrates a second exemplary network 200 similar to the first exemplary network 100 of FIG. 1 . Network 
200 of FIG. 2 has an optical core 206 comprising a number of core nodes 204, two of which 204A and 204B are 
30 illustrated, and differs from the first exemplary network 1 00 in that the core nodes 204 switch individual "switched units". 
These switched units derive from a division of the capacity of each fiber link into smaller units, using, for example, 
time-division multiplexing (TDM). 

[0044] In a TDM scheme, the electronic edge nodes 102 repetitively send TDM frames, of a gh/en number of time 
slots, having a structure predetenmined at a core node 204. A data block is transferred during each time slot. The 
35 Structure specifies the time slots in which a particular source electronic edge node 102 should place data blocks des- 
tined for a particular destination electronic edge node 102. Communication between the source electronic edge node 
102 and the core node 204 is used to establish the TDM frame stmcture. Such a scheme requires precise time coor- 
dination between electronic edge node 102 and core node 204. 

[0045] A related burst switching scheme also requires precise time coordination between electronic edge node 102 
40 and core node 204. In such a scheme, an electronk: edge node 102 indicates, to the core node 204, a desire to transmit 
a large data block. The core node 204 may reply with a time to start sending the data block. 

[0046] The reach index of an edge node in the second exemplary network 200 is greater than that of an edge node 
in the first exemplary network 100 because of the finer granularity. of NeKvork 200. Consider the example electronic 
edge node 1 02U having two WDM links to the optical core with each link canying two wavelength channels as described 

45 above with reference to FIG. 1 . Using a TDM switching scheme where the number of time slots per TDM frame is 128, 
for example, then the two wavelength channels connecting edge node 102U to core node 204A can be divided into 
256 time slots and at a given instant of time, edge node 102U can communicate with all other edge nodes connecting 
to core node 204A. Likewise, edge node 102U can communicate with all other edge nodes connecting to core node 
204B, and some edge nodes can be reached through either or both of core nodes 204A and 204B. 

50 [0047] The use of TDM also enables partitioning the capacity of a light beam, regardless of its wavelength band, into 
optical signal units, each optical signal unit corresponding to a data unit transmitted from an edge node 102 to another 
edge node 102. If an electronic edge node 102 has a relatively large number of fiber links, 1 6 for example, to the optical 
core, and using 1 .024 time slots per TDM frame, the number of optical signal units would be 16,384 and a high reach 
index can be realized for each edge node 102. This eliminates the need for individual wavelength switching. There are 

55 many advantages of such wavelength-unaware switching in the optical core. The use of such a scheme requires timing 
considerations to account for the differing propagation speeds of the constituent wavelength channels in the lime- 
slotted light beam as described in Applicant's United States Patent Applicatton 09/960,959. filed on September 25, 
2001 and titled "Switched Channel-Band Network". 
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[0048] The performance, efficiency and scalability of a teleconnmunications network, comprising nodes interconnect- 
ed by links, depend heavily on the nodal degree and the directly related network diameter. The degree of a specific 
node is a measure of the number of nodes to which the specific node directly connects. A direct connection may be 
continuous or time-slotted. A time slot, or a plurality of time slots, in a time-slotted link also constitutes a direct path 

5 along the link. The diameter of a network is a measure of the maximum number of hops along the shortest path between 
any two nodes. The higher the nodal degree, the smaller the network diameter becomes, and a small network diameter 
generally yields high perfonmance and high efficiency. On the other hand, for a given nodal degree, scalability generally 
increases with the network diameter, but to the detriment of network efficiency. It is therefore advantageous to increase 
the nodal degree to the highest limit that technology permits. To illustrate the implications of nodal degree, reference 

10 is made to FIGS. 1 and 2. The reach index of a core node in the networks of FIG. 1 and FIG. 2 is defined herein as 
the number of edge nodes that can exchange signals through the core node. 

[0049] FIG. 3 qualitatively illustrates the effect of switch granularity on edge-to-edge hopping. Fine switch granularity 
reduces the number of hops and, hence, increases the network efficiency. The advantages of fine switch granularity, 
however, are realized at a cost because increasingly fine switch granularity requires increasingly more elaborate con- 
is trols. Selecting an appropriate granularity is crucial to network efficiency and performance. The inter- relationships 
among the nodal degree, network diameter, core switching latency and network efficiency are illustrated In four quad- 
rants of a plot 300 in FIG. 3, labeled Q301, Q302, Q303 and Q304. The first quadrant Q301 shows the effect of core 
switching latency on the nodal degree. Fast switching (low latency) in the core enables time sharing of links and, hence, 
a higher nodal degree. The nodal degree also increases with node capacity. A high-capacity node can support several 
20 links, each of which may further be time divided. The second quadrant Q302 illustrates the effect of nodal degree on 
network diameter. Naturally, a high degree results in a high direct connectivity and, with proper topology, a smaller 
number of hops. The third quadrant Q303 illustrates the dependence of network efficiency on the network diameter 
where a smaller diameter increases efficiency. Finally, the fourth quadrant Q304 illustrates the enhancement of network 
efficiency with low switching latency and high nodal capacity. FIG. 3 illustrates two cases, case 'A* with a high network 
25 diameter, hence low network efficiency and case 'B' with a high degree, low diameter and high efficiency. Case 'A' in 
FIG. 3 represents a multi-hop network while case 'B' represents a network where the mean number of hops is small. 
Case 'B' enables higher performance and lower network cost. Having an optical core node of high-capacity and a low 
switching latency is therefore highly desirable. FIG, 3 elucidates the benefits of having an optical core node of high- 
capacity and low latency. 

30 [0050] FIG. 4 illustrates a modular optical core node 400 constructed as a modular switch comprising five switch 
modules 404A, 4048, 404C, 404D, 404E, (referenced herein collectively or individually as 404) each switch module 
404 connecting to edge nodes 102 through WDM links 412 and to each other switch module 404 through internal links 
420, each link 412, 420 carrying at least one wavelength channel. Each of the five switch modules 404A, 404B, 404C, 
404D, 404E includes a corresponding space switch 408A, 408B, 408C, 408D, 408E (referenced herein collectively or 

35 individually as 408) for switching signals from input ports to output ports (not illustrated). The input ports are divided 
into inlet ports that receive signals from edge nodes and inbound ports that receive signals from other switch modules 
404. The output ports are divided into outlet ports that send signals to edge nodes and outbound ports that send signals 
to other switch modules 404. Inlet ports and outlet ports receive signals from, and transmit signals to, edge nodes and 
are. therefore, called outer ports. The inbound ports and outbound ports of a switch module 404 receive signals from, 

40 and transmit signals to other switch modules and are, therefore, called inner ports. An internal link 420 connecting a 
first switch module to a second switch module connects an outbound port of the first switch module to an inbound port 
of the second switch module, and the internal link 420 is therefore an outbound link with respect to the first switch 
module and an Inbound link with respect to the second switch module. All internal links 420 are unidirectional. Thus, 
link 420BE carries signals from switch module 4048 to switch module 404E and another link 420EB (not illustrated) 

^ carries signals from switch module 404E to switch module 4048. A link 420 may connect several outbound ports of a 
first switch module 420 to an equal number of inbound ports of a second switch module. Each internal link 420 may 
carry a single wavelength channel era band of wavelength channels. 

[0051] Each edge node 102 has an uplink 412 to a switch module 404 and a downlink 418 from the same, or another, 
switch module 404 of the modular core node 400. An uplink 412 or a downlink 418 may carry several wavelength 
so channels. A given edge node 102 may have more than one uplink to the modular core node 400 and more than one 
downlink from the modular core node 400. For a given edge node 102, the number of uplinks need not equal the 
number of down links and an edge node may choose to send control signals to the modular core node along only one 
channel in an uplink 412 and receive control signals through one channel of a downlink 418. 

[0052] If each uplink 412 and each downlink 418 carry multiple wavelength channels, the modular core node 400 
S5 may either switch time-slotted multi-channel signals in a single space switch 408, or use a number of space switches 
408 per switch module 404. A multi-channel signal occupies a band of wavelength channels as described in the afore- 
mentioned United State Patent Application 09/960,959. The present disclosure is based on the generic case of switch- 
ing time-slotted multi-channel signals. The use of a multi-channel signal of course covers the case of a single-channel 
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signal. 

[0053] A modular switch operating in a TDM mode receives signals from electronic source nodes and a time-slot 
switching stage takes place at each source node. If each switch module in the modular switch is an optical switch, the 
time slots selected for a given connection request may be switched to other time slots at both the source node and 
s sink node. Time-alignment of source nodes and a corresponding optical switch, module is enabled by a time-focking 
process to be described hereinafter. 

[0054] This mesh structure is used to increase the number of inlet and outlet ports of the modular core node. The 
modular core node 400 can be used in either a channel-switching mode, as described with reference to FIG. 1, or in 
a TOM switching mode as described with reference to FIG. 2. The benefits of using a high -capacity core node were 

10 described above with reference to FIG. 3. 

[0055] Preferably, each switch module 404 would have a direct internal link 420 to each other switch module 404. 
The traffic volume from a switch module 404A to a switch module 404B may be consistently higher than the capacity 
of the direct link from switch module 404A to switch module 404B. It is preferable in this case to reserve the internal 
link from switch module 404A to switch module 404B for the exclusive use of the 404A-404B traffic. The 404A-404B 

'5 traffic may still overflowto other routes through Intemiedrate switch modules, where each connection requires traversing 
two internal links 420. 

[0056] Hereinafter, the term "calendar" Is used to refer to an array having as many cells as the number of time slots 
in a TDM frame, where each cell (each entry) contains an indication of the state, busy or free, of a corresponding time 
slot in the TDM frame. A cell (entry) in a calendar is said to be busy or occupied if the corresponding time slot is 
20 reserved. A calendar may be associated with an input port, an output port, an uplink, a downlink, or an intemal link in 
a modular switch. The process of "comparing" two or more calendars is a process of examining con'esponding cells 
(entries) in the compared calendars to determine If the examined cells (entries) are ail free (in a vacant state), thus 
yielding a matching time slot in a corresponding TDM frame. 

[0057] The absolute occupancy of a calendar is an integer number of the number of busy (occupied) cells and the 
25 absolute vacancy is the number of free (vacant) cells in the calendar. The relative occupancy of a calendar is the 
absolute occupancy divided by the number of cells in the calendar and the relative vacancy is the absolute vacancy 
divided by the number of cells in the calendar. For brevity, the terms occupancy and vacancy are used to indicate, 
respectively, the relative occupancy and relative vacancy. The term "occupancy state" of acalendar refers to the number 
of occupied cells and the distribution of the occupied cells within the calendar. 

30 

Path allocation 

[0058] Consider a modular core node of an arbitrary structure and having N input ports, M output ports, N > 1 and 
M > 1 . The input ports transfer signals to output ports through internal links within the modular core node. A route set 

35 Is designated for transferring signals from each input port to each output port. The number of routes in a route set can 
vary from a single route to several routes, and a route may traverse a single link or a concatenation of at least two 
links. The modular core node has a controller that includes a route-set memory storing NxM route sets, one route set 
for each pair of input and output ports. The routes In the NxM route sets naturally intersect, and a particular link may 
be included in a large number of routes. The routes in a route set may be classified according to some merit such as 

40 the number of links per route and, naturally, a route having the least number of links is preferred and would be selected 
if it has a sufficient vacancy to accommodate a connection request. Each connection request includes a connection 
identifier and specifies an input port, an output port, and a requested number q of time slots in a slotted time frame. 
[0059] !n both channel switching and TDM switching, it Is desirable to equalize the vacancy of the internal links in 
order to increase the throughput, given the varying spatial distribution of the traffic load. Vacancy equalization can be 

45 realized by selective routing, where two or more routes of comparable merit (for example, the same number of links 
per route) would be designated as candidate routes and when two or more routes have a sufficient vacancy to accom- 
modate a connection request, the route with the highest vacancy would be selected. Selective routing can, however 
be computationally intensive. A simple, yet effective, technique is to select the routes according to merit classifrcation 
and, within each merit classification, select a route of sufficient vacancy in a cyclic manner. 

50 [0060] If the modular core node is operated in a time-division-multiplexing (TDM) mode, It is desirable that the oc- 
cupancy of each route be packed in the TDM frame so that routes have a high probability of being occupied during 
time slots near a selected reference time slot (usually the start of a TDM frame) and have a high probability of being 
vacant during time slots away from the selected reference time slot. This temporal packing discipline is in contrast with 
the spatial load-equalization discipline described above which attempts to equalize the vacancy across the internal 

55 links. 

[0061] The route-selection discipline used for the modular high-capacity optical core node according to the present 
invention is based on: 
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(1) favoring routes having a least number of links; 

(2) spatial load-equalization of routes of the same number of links; and 

5 (3) temporal packing. 

[0062] Because each input port In the modular switch may transmit to many output ports, hence each output port 
may receive from many input ports, vacant time slots at a given pair of input port and output port may not be aligned. 
Mismatch occurs where vacant time slots in a multi-link route are not aligned. Temporal packing significantly reduces 

10 the mismatch probability. However, this is realized at the expense of an extensive search effort because the search 
must start from the same reference time slots for alt connection requests and the required number of vacant time slots 
are more likely to be found near the end of the TDM frame period. If the number, S, of time slots per TDM frame Is 
1 ,024, and with a high mean occupancy of 0.90, for example, a large proportion of connection requests would require 
scanning more than 800 time slots across ail links of a candidate route. This can significantly reduce the scalability of 

IS the modular core node. 

[0063] To circumvent this difficulty, the scheduling method in accordance with the present invention divides the TDM 
frame into a number of sub-frames, and uses a cascade of schedulers each of which operating on a specific sub-frame. 
The sub-frame need not be of equal duration . Using H cascaded schedulers, where H > 1 , a connection request requiring 
q > 0 time slots is offered to a head scheduler which attempts to find matching time slots within the first sub-frame and 

20 relays the connection request, with the pending number of time slots, to a second scheduler if the pending number is 
greater than zero. The second scheduler attempts to find matching time slots along routes in the route set and relays 
the connection request to a third scheduler if the pending number of time slots is not zero, and so on. This permits 
simultaneous operation of all schedulers where the schedulers would be processing different connection requests. 
[0064] The method requires that each scheduler be provided with a result memory to hold allocated time slots within 

25 a respective sub-frame, and a cascade memory to hold the parameters of a connection request to be relayed to a 
subsequent scheduler, if any. A result selector cyclically visits the result buffers of the H schedulers to read the records 
of allocated time slots. 

[0065] The routes in each route set may be classified into a number, r, of categories, r > 1 , where the 7**^ category, 
1 < y< r includes routes requiring a corresponding order of time-slot matching process. A route set being defined for 
30 each pair of input and output ports. For each connection request specifying an inlet port and an outlet port that belong 
to different switch modules, the entire cascaded scheduling process described above would be applied to one category 
to determine allocable time slots and if the allocable number of time slots is less than the required number, the cascaded 
scheduling process may be applied to another category. 

[0066] The categories are preferably sorted according to a descending order of preference, the first category being 
35 the most preferred. 

[0067] If the pending number of time slots is not zero after all sub-frames of the TDM frame have been considered, 
i.e., after the last of the H schedulers has completed its time-slot matching process, the cascade-scheduling process 
is repeated, starting with the head scheduler and using routes classified to be in the next preferred category, if any. 
[0068] If the core node is a single-stage space switch, there is only one route for each pair of input and output ports, 

40 and r, therefore, equals 1 . If the core node is a multi-stage (modular) core node employing space-switching modules, 
then the value of r depends on the maximum number of switch modules to be traversed by a route in the route set. 
[0069] In an unfolded three-stage modular core node, each route traverses three switch modules. There is only one 
category, I.e., r = 1 , requihng a third-order time-slot matching process. A route set for each pair of input-output ports 
may include numerous routes, each requiring a third-order matching process. 

45 [0070] In a folded three-stage modular core node (not illustrated), each switch module in a first array of switch mod- 
ules serves both as a first stage and a third stage switch module, and each switch module in a second array of switch 
modules serves as a second-stage switch module. A path between an input port and an output port belonging to the 
same switch module in the first array of switch modules requires a first-order time-slot matching process. A path be- 
tween an input port in a first switch module in the first arrays of switch modules and an output port in a second switch 

50 module in the first array of switch modules requires a third-order time-slot matching process. Thus, there are only two 
route categories (r = 2) within some route sets including only a single path each within a space switch and each of the 
remaining route sets including numerous candidate paths through the switch modules of the second array of switch 
modules, and allocating each candidate path requires a third-order time-slot matching process. 
[0071] In a double-folded three stage modular core node, where each switch module in a single array of switch 

55 modules serves as a first-stage switch module, a second-stage switch module, and a third-stage switch module, there 
are also at most two categories (F = 2) in a route set. The switch modules in a double-folded three-stage switch modular 
core node are futly-meshed as illustrated in FIG. 4. A route set for pairs of input-output ports belonging to the same 
switch module includes only one internal path, through a switching module, and requires a first-order matching process. 
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A route set for pairs of input-output ports belonging to different switch modules includes a direct path connecting the 
two switch modules and (N-2) paths each traversing one of the switch modules, other than the switch modules sup- 
porting the input and output ports specified in the connection request, and requiring a third-order matching process to 
allocate. 

5 [0072] State arrays and matrices, to be described below, are needed to facilitate the path scheduling process. Upon 
completion of the cascade scheduling process for all route categories in a route set, one of two policies may be adopted 
if the allocable number of time slots is still less than required. In one policy, the corresponding connection request 
would be rejected, the allocable time slots would be released and the corresponding entries in the state arrays and 
matrices used for path allocation would be reset to a free state. In an alternate and preferred policy, the allocable time 

'0 slots would be communicated to the switch-module controller which may accumulate the allocable time slots according 
to input-output port pairs and grant admission to any of waiting connection requests. For example, if a connection 
request, fonwarded by a given switch module to a master controller of a modular core node, specifies 1 8 time slots and 
only 14 can be allocated, the switch module may grant four of the allocable time slots to a new connection request, 
retain the remainder, and send the master controller a connection request specifying only eight time slots. This process 

^5 can be carried on continually, where a connection request may be delayed until an accumulated number of allocable 
time slots for a pair of input and output ports is sufficient. Naturally, when a connection Is terminated, the switch-module 
controller instructs the master controller to release the corresponding resources, i.e., to reset corresponding entries in 
the state arrays and state matrices to a free state. The state arrays and matrices wilt be described below. 
[0073] FIG. 5 illustrates a path in an exemplary prior art core node 504. The path traverses a first switch module 

20 SOSH', a second switch module 5060 and a third switch module 506A {referred to collectively or individually as 506). 
The optical core node 504 is used to connect a first electronic edge node 102 J to a second electronic edge node 102W. 
Each switch module 506 includes a respective space switch 5084^, 5089, 508A. Each switch module 506 along the 
illustrated path from the first electronic edge node 1 02J to* the second electronic edge node 1 02W performs a switching 
stage. To eliminate the need for strict time-coordination, an inter-module de-coupiing buffer 51 OH' is provided to inter- 

25 pose the space switch SOSH' at the first switch module 5064^ and the space switch 5080 at the second switch module 
5060. Additionally, an inter-module de-coupling buffer 5100 is provided to interpose the space switch 5080 at the 
second switch module 5060 and the space switch 508A at the third switch module 506A. These inter-module de- 
coupling buffers 5104^, 5100 facilitate path allocation. The availability of inter-module de-coupling buffers 5104^, 5100 
interposing the bufferless switch modules 506 greatly simplifies the scheduling process because a path traversing all 

30 the space switches can be established as a concatenation of individual paths that are independent from each other. 
This reduces the path allocation process to a number of simpler independent matching processes. Providing such 
decoupling buffers is not currently feasible for optical switch modules without resorting to optical-to-e!ectrical conversion 
and electrical-to-optical conversion. 

[0074] FIG. 6 illustrates a first path In the modular core node 400 of FIG. 4. The path traverses a single switch module 
35 404B. The modular core node 400 is used to connect a first electronic edge node 102J to a second electronic edge 
node 102K, both connecting to the sameswitch module 404B. Recall, from FIG. 4, that the switch module 4048 includes 
a space switch 408B. Strict time-coordination with edge node 102 J is required due to the absence of a receiving buffer 
at the switch module 404B. For TDM switching, a simple first-order time-slot matching process, well known in the art, 
can be used for allocating time slots requested for a connection. Note that a time-slot interchange takes place at both 
40 edge nodes 102J and 102 K. 

[0075] FIG. 7 illustrates a second path, in the modular core node 400, that traverses a first switch module 404B and 
a second switch module 4045. The modular core node 400 is used to connect a first electronic edge node 102J to a 
second electronic edge node 102V. Each switch module 404 includes a space switch 408. Each switch module 404 
along the illustrated path from the first electronic edge node 102J to the second electronic edge node 102V perfonms 
45 a switching stage. Strict time-coordination with edge node 102J is required, as in the connection of FIG, 6, and, in 
addition, a compound second-order time-slot matching process (described in detail hereinafter), in accordance with 
an embodiment of the invention, is required for TDM switching. A time-slot interchange takes place at both edge nodes 
102 J and 102V. 

[0076] FIG. 8 illustrates a third path in the modular core node 400. The path traverses the first switch module 404B, 
50 a third switch module 404D and the second switch module 404E. The modular core node 400 is used to connect a first 
electronic edge node 102J to a second electronic edge node 102W. Each switch module 404 includes a space switch 
408. Each switch module 404 along the illustrated path from the first electronic edge node 1 02J to the second electronic 
edge node 102W perfomns a switching stage. Strict time-coordination with edge node 102J is required, as in the con- 
nection of FIG. 6, and, in addition, a compound third-order time-slot matching process (described in detail hereinafter), 
55 in accordance with an embodiment of the invention. Is required for TDM switching. A lime-slot interchange takes place 
at both edge nodes 102 J and 102W. 

[0077] As discussed above in conjunction with FIG. 5, a path traversing two or more space switches operated in a 
TDM mode with Interposing data buffers can be determined by independently establishing a path within each space 
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switch according to a first-order matching process, and the nriatching time-slot selected m the space switches need 
not be contemporaneous because time-slot interchange at the interface of two successive space switches is feasible. 
[0078] Thus, in a network of electronic switches, a compound matching process of an order G, where G is greater 
than unity, can be decomposed into G stages, each stage requiring a first-order matching process. This requires data 
5 buffering between successive stages as illustrated in FIG. 5. In a network of optical switches, data buffering is not 
feasible and a compound matching process of order G requires concurrent time slot availability in (G + 1 ) ports as will 
be detailed below. 

[0079] In a first-order matching process^ a connection requesting a single time slot or multiple time slots requires 
each time slot to be free in two corresponding ports. With a compound, second-order matching process, a connection 
10 having multiple time slots requires each time slot in the connection to be free in three corresponding ports and with a 
compound, third-order matching process, a connection having multiple time slots requires that each time slot be free 
In four corresponding ports. 

[0080] First-order matching has been used extensively in circuit switching and the perfonnance of first order matching 
is well known even with multiple-time-slot connections. It is also known that, with a sufficiently large ratio of the number 
15 of time slots per calendar to the largest number of time slots per connection, good perfomiance of first order matching 
can be realized at a high throughput (see for example, Beshai et a/., "Multichannel Sen/Ices: Perfonmance of Switching 
Networks". Proceedings of the International Teletraffic Congress, ITC 12, June 1988). 

[0081 ] The probability of successful matching decreases rapidly as the number of path segments in a path increases, 
i.e., as the order of the time-slot-matching process increases. 

20 

Connection-request blocking 

[0082] In a first-order matching process of two calendars, it is well known that zero blocking of connection requests, 
each specifying a single time slot, is realized when the absolute occupancy of each calendar, i.e., the number of oc- 
25 cupied cells in a calendar, is limited to a value C so that the number, S, of time slots per calendar equals or exceeds 
(2XC-1). 

[0083] In a matching process of J calendars, zero blocking of connection requests each specifying at most v time 
slots, v^1 , can be realized if the absolute occupancy of each of the J calendars is limited so that 
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where Xj is the maximum absolute occupancy of calendar]. 

With Xi =X2= =Xj = C, the maximum absolute occupancy, C, is limited so that 



40 



C<(S^(J'1)\f)iJ ,\>2, 



For example with S = 1 ,024. J = 4, and v = 8, C Is limited to 262, i.e. , the maximum relative occupancy of each calendar 
would be about 0.26. This low utilization is of course unacceptable and means for realizing high utilization, exceeding 
45 0.85 for example, at a negtigtbte connection -request blocking are needed. 

Time-slot matching process 

[0084] In abufferless single-stage TDM switch having a number inlet ports and outlet ports, a calendar is associated 
50 with each inlet port and each outlet port. Each calendar has the same number, S, of time slots. When the controller of 
the TOM switch receives a connection request, the request specifying an inlet port, an outlet port, and a number of 
time slots per calendar, the controller of the TDM switch compares the calendar associated with the specified inlet port 
with the calendar associated with the specified outlet port to find matching time slots that are vacant concurrently in 
the two calendars. If the number of calendars Is large, over 16 for example, the occupancy states of the calendars 
55 become weakly correlated and can be treated as uncorrelated. 

[0085] The computation of the distribution of the number of matching time slots in any two calendars is quite tedious, 
even when connection requests an-ive at random and under the assumption that the occupancy states of all calendars 
are uncorrelated, as described in the paper titled "Multichannel Services: Perfonmance of Switching Networks", refer- 
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enced hereinbefore. The computation requires deterniining the nnatching time slots for each pair of occupancy states 
of the t\A/o calendars. 

[0086] When one calendar has m free time slots and the other has n free time slots, and with random distribution of 
the free time slots within each calendar, the probability of not finding any matching time slot, which is the probability 
5 Of blocking a connection request specifying only one time slot, is detemiined as; 
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The overall mean blocking is then determined by taking into account the statistical distribution of both m and n. 
[0087] Expression (2) can be approximated as: 



2S-m+l) 



20 where m>0,n>0,m<n, and m + n < (S + 1 ). 

[0088] In a bufferless multi-stage TDM switch, a connection may traverse two or more single-stage switch modules, 
thus requiring compound time-slot matching. A calendar is associated with each inlet port, and each outlet port of each 
switch module, and a calendar is associated with each inner link connecting two switch modules. Each calendar has 
the same number S, of time slots. When the switch controller receives a connection request, the request specifying 

25 an inlet port, an outlet port, and a number of time slots per calendar the switch controller compares the calendar 
associated with the specified inlet port with the calendar associated with the specified outlet port and the calendars 
associated with any traversed inner links to find matching time slots. 

[0089] The number of calendars to be examined for any connection request Is denoted J, and the order of a com- 
pound-matching process is (J-1). In the modular switch of FIG. 4, the maximum value of J is 4, and a connection may 
30 require a first-order, second-order, or third-order matching process. 

[0090] The computation of the probability of blocking when more than two calendars must be compared (J > 2) 
becomes unwieldy. The probability of not finding any matching time slots In J > 2 calendars for a given occupancy state 
where is the number of free time slots in calendar j,1 5 j < J. can be approximated as: 



i-n( 



(4) 



where 0 < ^j< ( S - (4,-1 )/2), 1 < j S J. Without loss of generality, 4, is selected to be the least of {4,, ... 
[0091] The above blocking probability applies only for a given occupancy state and for the case where each con- 
nection request specifies only one time slot. To detemiine the mean blocking, the distribution of q,, ^, must be 
^ taken into account. However, despite its limitation, this approximation is useful for illustrating the effect of multi-stage 
compound matching on connection-request blocking, and, for this purpose, it would be sufficient to consider the occu- 
pancy state where each of the J calendars to be compared for a connection request has the same number, n, of vacant 
time slots, i.e.,^i =^3= .. =^ = n. The probability of not finding any matching time slots in the J calendars is then; 



[0092] Consider, for example, the case where S = 1 28, with a mean vacancy of 0.20, i.e., about 25 expected vacant 
ss time slots within each calendar Using n = 25 in expression (5) to detennine the blocking probability, under the conditions 
described above, yields: 

P= 0.00232 for J = 2 (first-order matching), 
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P= 0.30452 for J = 3 (second-order matchmg), and 

p= 0.7776 for J = 4 (third-order nnatching). 

5 Using S = 1.024. and n = 200 (i.e., the same relative occupancy) yields values of P of 6.7x10*22, 6.87x10-5, and 

0.13085 for J = 2, 3, and 4, respectively. 

[0093] The blocking for connection requests specifying multiple time slots per calendar period would be higher than 
the above values. For each inlet port in the modular switch of FIG. 4, there are (N-2) paths that may use third-order 

10 matching, but only a single path that uses second-order or first-order matching. The blocking shown for J = 4, is based 
on examining only one of the {N-2) path and would be reduced when more than one path are attempted. 
[0094] With time-slot packing, the blocking values under the same load condition, would be much lower than the 
above values. Computing the blocking with time-slot packing is very difficult to determine analytically 
[0095] A parameter of interest, specially for connection requests specifying multiple time slots per calendar, is the 

15 mean value of the number of matching time slots. With Gj denoting the relative vacancy in a calendar j, 1 < j ^ J, the 
mean value, \i, of the number of matching time slots in J calendars can be approximated by: 



20 ^=Sxn (6) 

[0096] Selecting a reference occupancy state where each of the J calendars has the same number x of vacant time 
slots, resulting in a relative vacancy 0 = t / S, with 

25 

then the mean number of matching time slots is determined as fx = S x 0^. 
30 [0097] The probability of zero matching and the mean number of matching time slots under a given occupancy state, 
indicated by the number of free time slots per calendar are tabulated below for selected values of S and J. 



35 



Time Slots Per Calendar 


Free Time Slots 


Probability Of Zero Matching 


S 


T 


J=2 


J=3 


J=4 


128 


25 


0.0023 


0.3045 


0.7776 


256 


50 


5.2x1 0-s 


0.09176 


0.6027 


512 


100 


2.6X10-11 


0.00833 


0.3621 


1,024 


200 


6.7X10-22 


6.87X10-5 


0.1307 



Time Slots Per Calendar 


Free Time Slots 


Mean Number of Matching Time Slots 


S 


T 


J -2 


J =3 


J=4 


128 


25 


4.882 


0.953 


0.186 


256 


50 


9.764 


1.906 


0.372 


512 


100 


19.528 


3.812 


0.744 


1,024 


200 


39.056 


7,624 


1.488 



[0098] Given the low mean number of matching time slots with J = 4, even with a significantly large number of time 
slots per calendar (S = 1 ,024, for example), it Is imperative that connection requests specifying a relatively large number 
of time slots per calendar period use more than one path through an intermediate switch module when a direct link 
requiring second-order time-slot matching does not have a sufficient number of matching time slots. There are (N-2) 
such paths and the required time slots for a connection may be distributed among all the (N-2) paths. It is noted that 
the delay within the modular core node is negligible and, hence a connection using a combination of direct and indirect 
paths will still have aligned time slots at the outlet port. 
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Balancing the loads among calendars 

[0099] The calendars are divided into three groups associated with inlet ports, outlet ports, and inner links. An inner 
link connects an outbound port of a switch module to an inbound port of another switch module. Two measures can 
be taken to maximize the utilization of the ports. The first measure is to equalize the occupancies of the calendars 
associated with the inner links. This can be realized by selecting a candidate route from routes of the same category 
within each route set in a cyclic manner for consecutive time-slot-allocation attempts. The second measure is to pack 
the time slot allocation by examining the time slots starting with a reference time slot. 

[0100] FIG. 9, which comprises FIGS. 9A, 9B, and 9C, illustrates four time-slotted frames, also called calendars, 
each having a number, S. of cells, each cell con-esponding to a time slot of a predefined duration and containing a 1- 
bit indication of the cell's free or busy state. Each calendar corresponds to a link, or a port, in a multi-stage (modular) 
core node. Calendars for four links labeled A, B, C, and D are illustrated. An allocable time slot in a path traversing 
two links is a time slot that is vacant in both links. An allocable time slot in a path traversing three links is a time slot 
that is vacant in the three links. Likewise, an allocable time slot in a path traversing four links is a time slot that is vacant 
in the four links as described earlier. The process of finding an allocable time slot across two links is called a first-order 
matching process. The process of finding an allocable time slot across three links is called a second-order matching 
process. The process of finding an allocable time slot across four links is called a third-order matching process. In a 
switching network, or in a modular core node, a first-order matching process corresponds to a path traversing one 
space switch, a second-order matching process corresponds to a path traversing two space switches, and a third- 
order matching process corresponds to a path traversing three space switches. A path traversing any number of space 
switches may require several time slots per calendar period. As would be appreciated by a person skilled in the art. 
the probability of finding an allocable time slot decreases significantly as the number of traversed space switches 
increases. 

[0101] FIG. 9A illustrates a case where the search for an allocable time slots is conducted at random, or according 
to a simple cyclic discipline where the search for an allocable time slot starts from the same time slot at which the 
search for a preceding allocable time slot ends. This search approach is suitable for finding a path in the core node 
504 of FIG. 5, because the provided buffers 510 enable decomposing a second-order matching process Into two first- 
order matching processes, and a third-order matching process into three first-order matching processes. 
[0102] In the bufferless modular core node 400, a path across the node may traverse a single switch module 404, 
two switch modules 404, or three switch modules 404. The preferable path from an inlet port of a first switch module 
404 to an outlet port of a second switch module 404 traverses a single inter-modular link 420, if such a link is provided, 
and requires a second-order matching process. In addition, there are at most (N-2) paths each traversing an interme- 
diate switch module 404 and requiring a third-order matching process to allocate a time slot. 

[0103] FIG. 9C illustrates the occupancy of the four calendars, A, B, 0, and, D if an ideal time-slot packing process 
can be found, so that all the busy time slots in each calendar occupy consecutive cells. This leads to an alignment of 
vacant time slots that greatly increases the probability of finding allocable time slots. This occupancy pattern is not, 
however, realizable without a significant processing effort. A realizable occupancy pattern is visualized in FIG. 98, 
where the busy time slots in each calendar tend to create a busy zone close to one end of the calendar and the vacant 
time slots tend to create a vacancy zone near the other end, with a small proportion of busy time slots infiltrating the 
vacancy zone and vice versa. 

[0104] The occupancy pattern of FIG. 9B can be realized by using a simple packing process, where the search for 
an allocable time slot always starts from a reference time slot (reference eel!) in each calendar. This packing technique 
has been used extensively for first-order matching processes and its extension to compound matching processes 
(higher-order matching processes) should be obvious to a person skilled in the art. However, finding efficient means 
for real-time compound matching implementation, where a path allocation process must consume less than a micro- 
second, for example, of processing time, is a difficult task that requires new methods of real-time path scheduling. The 
method of the present invention will be described below with reference to FIGS. 22 to 25. 

[0105] Two measures can be taken to enable multi-stage connection in the absence of inter-stage buffers. The first 
is to use inner expansion and the second is to use spatial load balancing and temporal load packing. Both are described 
below. 

inner-expansion in a space switch 

[0106] The modular core node 400 includes a plurality of switch modules 404, each of which is non-blocking in the 
sense that any free input port (inlet port or inbound port) can connect to any free output port (outlet port or outbound 
port). An input port or an output port is said to be free if its vacancy equals or exceeds a switched data unit. Likewise, 
an outer link or an inner link is said to be free if its vacancy at least equals a switched data unit. A switched data unit 
is a channel, in a channel-switching switch module, or a time-slot in a TDM frame, in a TDM-switching switch module. 
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In both channel switching and TDM switching, a request for a connection from a free infet port in a switch module 404A 
to a free outlet port of another switch module 404B can be blocked even if switch module 404A has several free inner 
links emanating from its outbound ports and switch module 404B has several free terminating free links to its inbound 
ports. Blocking can happen due to spatial mismatch of the emanating and terminating free links. In TDM switching, 

5 blocking can also occur due to temporal mismatch of vacant time slots in spatially matched links. 

[0107] The mismatch probability naturally decreases with the decrease of traffic occupancy. In channel switching, if 
the number of vacant inner channels of the switch modules is large, the probability of spatial channel mismatch de- 
creases. Likewise, in TDM switching, if the number of vacant time-slots in the inner channels is large, the probability 
of temporal mismatch decreases. Thus, in order to reduce the mismatch probability in space switches, a common 

10 practice in the art is to provide more inner channels than outer channels. This automatically forces a reduced inner 
occupancy. The penalty, however, is a reduced utilization of the space switch. In a first-order matching process, it is 
well known that mismatch can be entirely eliminated if the number of inner channels is slightly lower than double the 
number of outer channels (the well-known Closs rule). In a higher-order matching process, the ratio of inner channels 
to outer channels required to eliminate blocking would be higher than two, as described earlier. If rather than eliminating 

IS mismatch blocking the objective is to reduce the mismatch probability to an acceptable value, 0.001 for example, then 
a lesser expansion ratio (Inner channels to outer channels capacity ratio) can be applied. However, in second-order 
and higher-order matching processes, the expansion ratio can still be extravagant. To circumvent this difficulty, the 
traffic packing process described above for a first-order matching process can be extended to higher-order matching 
processes. A process is provided herein for global time-slot occupancy packing across all possible internal paths be- 

20 tween each inlet port and each outlet port for which a connection is sought. 

Occupancy Packing 

[0108] Figure 9C illustrates the concept of occupancy packing. A scheduler matrix 940 comprises four calendars, 
25 each calendar corresponding to a link. Each column corresponds to a link in a three-stage path. Each Row in a scheduler 
matrix 940 corresponds to a time slot in a TDM frame. A three-stage path is a path that traverses three switching 
modules. An entry 962 in the scheduler matrix 940 corresponds to a time slot in a link. A shaded rectangle in an entry 
indicates that the corresponding time slot is already reserved for a connection. 

[0109] The columns in the scheduler matrix 940 are labeled "A", "B", "C, and "D". Referring to FIG. 8, column "A" 
30 corresponds to an ingress link (uplink) 41 2J from a source edge node to an inlet port of afirst switching module. Column 
"B" corresponds to an inner link 420BD from an outbound port of said first switching module to an inbound port of a 
second switching module. Column "C" corresponds to an inner link 420DE from an outbound port of said second 
switching module to an inbound port of a third switching module. Column "D" corresponds to an egress link (down link) 
41 8W from an outlet port to a sink edge node. Each of the schedule matrices illustrated has the same number of 
35 occupied entries 962. The occupied entries in the scheduler matrix 940-1 are randontiy scattered while the occupied 
entries in the scheduler matrix 940-3 are perfectly packed so that the free entries are perfectly aligned. The scheduler 
matrix 940-2 illustrates realistic packing of the occupied entries while the scheduler matrix 940-3 illustrates ideal pack- 
ing. 

[0110] Consider a three stage connection from an outer link ingress link (Column "A") to an egress link (Column "D"), 
40 the connection traversing three switching modules. With inter-module buffers 51 0, as depicted in FIG. 5, the connection 
can be established using three first-order matching processes. In the scheduler matrix 940-1, the connection can be 
established in three parts. The jump from row 6 to row 12 in column "B" and the jump from row 12 to row 1 in column 
"C" are enabled by the use of the inter-m.odule buffers 510^* and 5100, respectively. Without inter-module buffers 51 0, 
the three-stage connection must be established during the same time slot! With the random assignment of entries, as 
45 depicted in the scheduler matrix 940-1, a third-order packing process has a low probability of success, i.e., the prob- 
ability of finding four free entries in the same row, is low, even at relatively low link ulflization. In the realization of the 
scheduler matrix 940-1 , each column has at least four free entries, however, each row has at least one occupied entry 
and a three-stage connection request would be rejected. 

[011 1] With a realistic occupancy packing discipline, as depicted in the scheduler maira 940-2, a third order matching 
50 process has a high probabflity of success. The distribution of occupied entries in the scheduler matrix 940-2 yields two 
opportunities of third-order matching as indicated in time slots 990. The ideal packing of the scheduler matrix 940-3, 
which may be approached with extensive computational effort, offers four third-order matching opportunities. 
[0112] Occupancy packing results in a moderate throughput increase in a multi-stage node having Inter-module 
buffers but a much more pronounced throughput increase in a bufferiess multi-stage node. Occupancy packing is 
55 crucial for compound second-order and third-order matching processes. 
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Time-Locking Definition 

[0113] A first node X is said to be time locked to a second node Y along a given path if, at any instant of time, the 
reading of a time counter at node X equals the sum of a reading of an identical time counter at node Y and the prop- 

5 agation time, normalized to the time-counter period, along the given path from node X to node Y. The time counters 
at nodes X and Y have the same period. There may be several paths connecting the first node to the second node, 
and the paths may be defined by individual wavelengths in a fiber link or several fiber links. Due to the difference In 
propagation delays of different paths connecting the same node pair, time locking must be realized for the different 
paths individually. Due to dispersion, time locking of individual wavelength channels within the same WDM link may 

10 be required. When a first node is time locked to a second node along a given path, the given path is said to be time- 
locked. It is noted that the methods and apparatus of the present invention apply to both channel switching and TDM 
switching. 

[0114] The time-locking process in a time-shared network is described with the help of a two-node model. To realize 
time locking of a first node to a second node in a network, the first node is provided with a first controller that includes 

15 a first time counter and the second node is provided with a slave controller and a master controller that includes a 
master time counter. The second node has several input ports and output ports and the master controller is connected 
to one of the input ports and one of the output ports. The first controller sends an upstream control burst to an input 
port of said second node during a designated time interval, said upstream control burst including a reading of the first 
time counter. The upstream control burst is sent in-band, together with payload data destined to output ports of the 

20 second nodes. The slave controller must be able to direct said upstream control burst to said master controller during 
a pre-scheduled time Interval. The master controller has a device for acquiring and parsing upstream control bursts. 
The master controller compares the reading of the first time counter with a reading of the master time counter. An 
agreement of the two readings, or a negligible discrepancy, ascertains time alignment. The second controller reports 
reading discrepancies to the first controller which resets its time counter accordingly. 

25 [0115] Time locking an edge node to a core node means that a time counter at the edge node is time locked to a 
time counter at the core node. A time counter can be a conventional clock-driven counter. A time counter at an edge 
node is preferably an up-counter and a time-counter at a core node is preferably a down counter, the two counters 
have the same cycle duration. Using a 28-bit time counter, for example, driven by a clock of a dock period of 20 
nanoseconds, the duration of the time-counter cycle would be about 5.37 seconds (228 times 20 nanoseconds). The 

30 reading of an up-counter at an edge node increases, with each clock trigger, from 0 to 268,435,455 (0 to 22^-1 ) and 
the reading of a time counter at a core node decreases, with each clock trigger, from 268.435.455 to 0. If the edge- 
node controller sends a timing message, when its reading is K1 , to a core node, and the reading of the down-counter 
of the core node at the instant of receiving the timing message Is K2, then the edge-node controller must reset its up- 
counter to zero when the up-counter reading reaches [K2 -f K1 ] modulo 2^, B being the wordlength of the time counter 

35 (B = 28 in the above example). If K2 + K1 = 2^ -1 , the edge node is already time locked to the core node. 

[0116] In review, within a network, all time counters have the same period and time-coordination can be realized 
through an exchange of time-counter readings between each source node and its adjacent core node, i.e., the core 
node to which the source node is connected. In TDM switching, the time-counter readings are carried in-band, alongside 
payload data destined to sink nodes, and sending each time-counter reading must be timed to arrive at a corresponding 

40 core node during a designated time interval. 7>ie difficulty of securing time-coordination arises from two interdependent 
requirements. The first is that communicating a time-counter reading from a controller of a source node to a controller 
of a core node requires that the source node be time-locked to the core node, and the second is that time-locking a 
source node to a core node necessitates that a controller of the core node be able to receive a tinie-counter reading 
from the source-node controller during a designated interval of time. To initiate or restore time locking, a secondary 

45 mechanism is therefore required for directing upstream signals received from source nodes toward said master con- 
troller of the core node. 



Time-Loclcing Restoration 

[01 17] In the absence of time alignment, a time-locking recovery procedure must be initiated. The master controller 
sends a downstream control burst to said first controller to indicate the absence of time alignment. In response, the 
first node sends a succession of upstream control bursts each including a reading of said first time counter. Meanwhile, 
the slave controller directs a sample of said upstream control bursts to said master controller during a pre-scheduled 
time interval and the master controller acquires at least one upstream control burst from said sample and sends an 
Identifier of an acquired upstream burst and a corresponding reading of the master time counter to the first controller. 
The identifier may be a serial number of the upstream burst, or a reading of the first time counter included in the 
upstream control burst. The first controller then resets the first time counter accordingly to restore the required time 
locking. During this recovery phase, the slave controller, which controls the connectivity of input ports to output ports 
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of the second node, disconnects all paths to all output ports from the input port of the second node that connects to 
the first node. 

[0118] FIG. 10 illustrates a switch module 1004, to be used hereinafter as a building block in a modular high-capacity 
optical switch. 

5 [0119] Input ports of the switch module 1004 are divided into a set of inlet ports 1016 that connect to source nodes 
(as part of the electronic edge nodes 102) and a set of inbound ports 1006 that connectto other switch modules 1004. 
Each of the Inlet ports 1016 supports an upstream channel-band carrying data from source nodes. A controller input 
port 1010 is used to receive communications from a switch-module controller 1002. Output ports of the switch module 
1004 are divided into a set of outlet ports 101 8 that connect to sink nodes (as part of the electronic edge nodes 102) 

10 and a set of outbound ports 1008 that connect to other switch modules 1004. Each of the outlet ports 1018 supports 
a downstream channel-band carrying data to a sink node. A controller output port 1020 is used to send communications 
to the switch-module controller 1002. 

[0120] The input ports 1006,1016,1010 direct received optical signals to a space switch 1012 for switching toward 
output ports 1008,1018,1020 under configuration control of the switch-module controller 1002. Specifically, configura- 

is tion control of the space switch 1012 is provided in the switch -module controller 1002 by a configuration processor 
1024. The switch-module controller 1002 also includes a time-locking unit 1022 and an associated dock-driven time 
counter 1026. The configuration processor 1024 and the time-locking unit 1022 communicate with the switch module 
1004. In the preferred TDM switching scheme, each uplink, from an electronic edge node 102. arriving at an inlet port 
1016 has a predetermined, time-switched path to the switch-module controller 1002. Similarly the switch-module con- 

20 troller 1002 has a predetermined, time-switched path to the electronic edge node 102. These time-switched paths allow 
for the exchange of timing information between the time-locking unit 1022 and the eiectronk; edge node 102 in a time- 
locking procedure that is detailed in Applicant's United States Patent application, serial number 10/054,509, filed on 
November 13, 2001 and titled "Time-Coordination in a Burst-Switching Network". 

[0121] The configuration processor 1024 communicates configuration commands to the space switch 1012 to direct 
25 optical signals from particular input ports 1006,1 01 6, 1 01 0 to particular output ports 1 008, 1 01 8, 1 020, The configuration 
processor 1024 also exchanges control data with a master controller, to be described with reference to FIG. 11 , via a 
master controller interface 1028. 

[0122] A switch module receives connection requests from edge nodes, interprets each request, generates a time- 
slot-allocation request associated with each connection request, and communicates the time-slot-allocation request 

30 to the master controller of the modular core node. An edge node may send payload signals (payload data) to a modular 
core node through one or more upstream channels (an upstream channel is a channel in an uplink) and receive payload 
signals through one or more downstream channels (a downstream channel is a channel in a downlink). An edge node 
also exchanges control signals with the modular core node The control signals may be carried in-band. alongside the 
payload signals, and a single time slot in an upstream channel and a downstream channel would suffice to carry the 

35 control data. For each connection request, the edge node may select the inlet port and outlet port of the modular core 
node. Alternatively, the master controller of the modular core node may make this decision. 

[0123] A first switch module 1004(0) and a fifth switch module 1004(4) are illustrated in FIG. 11 as but two of a set 
of switch modules 1004 in a modular, high-capacity, optical core node 1100. As illustrated in FIG. 10, each of the switch 
modules 1004 has an associated switch-module controller 1002(0), 1002(4) that connect to a master controller 

40 11 02. The master controller 11 02 includes a time coordination device 11 08, to facilitate time-coordination of the master 
controller 1102 with the individual switch-module controllers 1002, and a connection-control circuit 1104 in communi- 
cation with a path selection device 1106. The time coordination device 1108 and the connection-control circuit 1104 
com.municate with the switch-module controllers 1002 via a switch-module controller interface 1112. The time coordi- 
nation device 1108 includes a time counter as described in application 10/054,509, referenced hereinbefore. 

45 [0124] As described above, each switch-module controller 1 002 has a time-locking unit 1022 in communication with 
a time counter 1 026. In order to enable a time-sharing switching scheme, using, for example, TDM switching, the time 
counters 1026 of all switch-module controllers 1002 must be aligned. To this end, the time coordination device 1108 
communicates with the switch-module controllers 1002 to initialize the respective time counters 1026 (to zero). The 
time counters 1026 are subsequently entrained by the time coordination device 1108 to follow a master time counter 

50 1110. The time coordination devtee 1108 is relatively simple, as internal propagation delays on links from the master 
controller 1 102 to the switch-module controllers 1 002 are either negligible, or are equalized (say, by using equal-lengths 
fiber links). Alternatively, the N switch-module controllers may have a common time-counter. Each edge node exchang- 
es time-locking measurements with the controller of the switch-module to which the edge node connects. Thus, the 
time-slotted signals received at all inlet ports of the modular switch are aligned. 

55 [01 25] The connection-control circuit 11 04 receives time-slot-allocation requests from the switch-module controllers 
1002. These time-slot-allocation requests identify an inlet port 1016 and an outlet port 1018 and a requested number 
of time-slots per scheduling period. The time-slot-allocation requests are then, under control of the connection-control 
circuit 1 1 04, sent to the path selection device 1 1 06. Ports 1 01 0 and 1 020 are provided with Electrk;al-to-Optical (E-O) 
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converters and Optical-To-Electrical (O-E) converters, which are not illustrated but understood to be present. 
[0126] A preferable structure for the modular, high-capacity, optical switch 1100 of FIG. 11 is a mesh structure 1200 
as illustrated in FIG. 12. The mesh structure 1200 is simplified to illustrate only the inner connections in the modular, 
high-capacity, optical switch 1100. In the mesh structure 1200 of FIG. 12, each of the first switch module 1004(0), a 
5 second switch module 1004(1), a third switch module 1004(2), a fourth switch module 1004(3) and the fifth switch 
module 1004(4) directly connects to each other switch module 1 004 by at least one inter-modular link, where the inter- 
modular link may comprise a wavelength channel. 

[0127] To maintain order in the mesh structure 1200 of FIG. 12, each inter-modular link from switch module 1004 to 
switch module 1004 may be assigned a label. As will be expanded upon hereinafter, the link labels relate directly to 

10 labels of the inner ports 1006, 1008 to which the (inner) links connect. 

[0128] In oven/iew, it is very well known in the art that high-capacity switching nodes can be synthesized from lower 
capacity switches using multi-stage structures. For reasons that follow, construction of a multi-stage, high -capacity, 
optical switch differs from construction of known, classical designs in that different, and more efficient, path selection 
methods are required. One reason for the difference is the absence of buffering in optical switches. Another reason is 

IS that there may be a requirement to time-lock the inlet ports 1 016 (or a controller of the inlet ports 1016) of each of the 
switch modules to respective edge nodes (which have buffers). Concurrently, there may be a requirement to time-lock 
to some other external nodes. Additionally, there will likely be a need to frequently reconfigure the paths through the 
various switch modules 1004 of the modular, high-capacity, optical switch 1100 to adapt to traffic changes. 
[0129] Realizing the modular, high-capacity, optical core node 1100 requires recognition of multiple challenges. For 

20 reasons stated above relating to topological reach, the modular, high -capacity, optical core node 1100 is preferably 
operated in a TDM (time-division multiplex) mode. To switch a data stream incoming from an electronic edge node 102 
to another electronic edge node 102, it will often be necessary to find a path from a switch module 1004 that receives 
the data stream to a switch module 1004 that transmits the data stream to a destination. 

[0130] A path segment between the first switch module 1004(0) and the second switch module 1004(1) may be 
25 created by aligning a vacant time slot of a calendar associated with an outbound port 1008 of the first switch module 
1004(0) with a vacant time slot of a calendar associated with an inbound port 1006 of the second switch module 1004 
(1 ). Multiple path segments nnay be required to pass a data stream from the switch module 1004 that received the data 
stream to the switch module 1004 that can transmit the data stream to its destination. Where this vacant-time-slot 
matching is to be perfomned across two or more space switches, there is typically a requirement for an intermediate 
30 buffer (see FIG. 5). However, if this vacant-time-slot matching is to be performed solely in the optical domain, such 
use of an intermediate buffer is not available, as such buffers do not currently exist. 

[0131] In addition to cascading the path finding process in the master controller 1102, each switch -module controller 
1002 should be time-locked with the source nodes (which are part of the electronic edge nodes 102) from which the 
corresponding switch module 1004 receives data streams at the inlet ports 1016. Such time-locking is necessary to 

35 prevent contention at the bufferless core nodes. Time-locking may not be required in a diannel-switched network if it 
may be acceptable that channels be kept idle during the execution of path-setup procedures. 
[0132] As described earlier a first node is said to be time-locked to a second node along a given path, if, at any 
instant of time, the reading of a time-counter at the first node equals the sum of a reading of an identical time-counter 
at the second node and the propagation time, normalized to the time-counter period, along the given path from the 

40 first node to the second node, where the time counters at the first and second nodes have the same period. There may 
be several paths connecting the first node to the second node, and the paths may be defined by individual wavelengths 
in a fiber link or several fiber links. Due to the difference in propagation delays of different paths connecting the same 
node pair, time-locking may be realized for the different paths individually. Due to dispersiorY, time-locking of individual 
paths may be required even for paths defined by wavelengths in the same fiber link. When a first node is time-locked 

45 to a second node along a given path segment, the given path segment is said to be time-locked. 

[0133] Time-sharing switching schemes, such as TDM switching or burst switching, require that each data block, 
whether the data block refers to data carried during a time-slot in a TDM frame or refers to a data burst in a burst 
switching scheme, arrive at an optical switch precisely at a predefined instant of time. This can be realized by time- 
locking the path segment from the electronk: edge node 102 to the modular, high capacity, optical core node 1100. 

30 Restated, in order to be able to switch data blocks arriving at the modular, high-capacity, optical core node 1 100 from 
different electronic edge nodes 102 having different propagation delays to the modular, high-capacity, optical core 
nodes 1100, without contention or the need for data storage at the core node, the electronic edge nodes 102 must be 
time-locked to each switch module 1004 to which the electronic edge nodes 102 send data streams. 
[0134] Each source node (as part of an electronic edge node 102) has at least one time counter and, as stated 

55 hereinbefore, the time-locking unit 1 022 of the switch-module controller 1 002 has a time counter 1 026. All time counters 
have the same period and time-coordination can be realized through an exchange of time-counter readings between 
each source node and its adjacent switch-module controller 1002, i.e., the switch-module controller 1002 associated 
with the switch module 1004 to which the source node is directly connected. The time-counter readings are transmitted 
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in-band, alongside payload data destined to sink nodes, and each tinne-counter reading transmitted must be timed to 
arrive at an adjacent switch-module controiler 1002 during a designated time interval. 

[0135] Difficulty in securing time-coordination may arise from two interdependent requirements. Thefirst requirement 
is that transmitting a time-counter reading from a controller of a source node to a switch-module controller 1002 requires 

s that the source node controller be time-locked to the switch-module controller 1002. The second requirement is that 
time-locking a source node to a switch-module controller 1 002 necessitates that the switch-module controller 1002 be 
available to receive a time-counter reading from the source node controller during a designated interval of time. To 
initiate or restore time-locking, a secondary mechanism is therefore required for directing upstream signals received 
from source nodes toward the switch-module controller 1002. Two such mechanisms are described in the applicant's 

10 US Patent Application No. 09/960,959, the specification of which Is incorporated herein by reference. 

[0136] Where time-locking is the activity perfonned by the time-locking unit 1022 of the switch-module controller 
1002 of FIG. 10, the other illustrated element of the switch-module controller 1002 is the configuration processor 1024. 
As stated above, the configuration processor 1024 acts, through communication with the space switch 1012, to direct 
an optical signal from a particular input port 1006, 1016 to a particular output port 1008, 1018 and exchanges control 

15 data with the master controller 1 102. The configuration of the space switch 1012 is detennined by the configuration 
processor 1024, based on configuration instructions received from the master controller 1102 as the result of a path 
finding process. 

[0137] The internal path finding process is of paramount importance in detemnining the throughput of the modular, 
high-capacity, optical core node 1100. The master controller 1102 receives time-slot-allocation requests from switch- 

20 module controllers 1002 for processing. The path selection process begins by looking for direct paths between switch 
modules 1004, which require a second-order time-slot matching process described hereinafter, before looking for in- 
direct paths, which require a third-order time-slot matching process, also described hereinafter. Advantageously, this 
order of processing may avoid unnecessary diversion of a connection to an indirect path. Note that those intra-switch- 
module paths that do not require traversing inter-modular links can be processed after the results of the second-order 

25 and third-order matching processes are performed. The reason for giving these intra-switch-module paths the lowest 
priority is that these intra-switch-module paths already enjoy a high probability of successful time-matching because 
each requires only a first-order matching process, also described hereinafter. 

[0138] First-order time-slot matching is a process used widely in TDM circuit switches. In this widely used process, 
the signal received at each Input port 1006, 1016 is pre-organized Into a time-slotted calendar and the signal at each 

30 output port is likewise organized. The calendar at all input ports and output ports are Independent but have the same 
duration (scheduling period) and the same number of equal-duration time slots. A time-slot-allocation request specifying 
a given inlet port 1016 and a given outlet port 1018 may require K> 1 time slots per calendar period. To satisfy this 
time-slot-allocation request, the switch-module controller 1002, more specifically, the configuration processor 1024, 
must find K time slots wherein both the given inlet port 1016 and the given outlet port 1018 are vacant, i.e., not used 

35 or planned to be in use. A time slot that is vacant in both the given inlet port 1016 and the given outlet port 1018 may 
be called a "matching time slot". There are several input ports and several output ports and the process of selecting a 
time slot to satisfy a time-slot-allocation request is generally unaware of future time-slol-allocation requests. Thus, it 
is possible that a matching time slot cannot be found even when the respective input port and output port have numerous 
vacant time slots. Such a scenario occurs when none of the vacant time slots in the given inlet port 1016 is aligned 

40 with a vacant time slot in the given outlet port 1018. 

[0139] As described earlier, it is well known in the art that the probability of successful time-slot matching Is greatly 
enhanced by adopting a primitive occupancy packing strategy, wherein the search for a matching time slot from any 
input port to any output port starts from a designated reference time slot and proceeds by examining the time slots of 
the calendar in a prescribed order until the requii-ed number of matching time slots is found or the time-slot-allocation 

45 request is rejected. Other occupancy packing techniques can be marginally more effective than the primitive occupancy 
packing techniques described above. Such techniques generally require more intensive processing, however, and the 
resulting improvement may not justify the required increased computational effort. 

[0140] If a path has to traverse more than one bufferless switch module 1004, the probability of successful time-slot 
matching decreases sharply as the occupancy of the ports traversed by the path approaches unity. The occupancy of 

50 a port is the number of allocated time slots divided by the number of time slots in the calendar. To find K matching time 
slots across A switch modules 1004, A > 1, at least K time slots must be vacant In (A + 1) ports defining a path. In a 
first order matching process, at least K time slots must be vacant in the respective inlet port 101 6 and outlet port 101 8. 
In a second-order matching process, where a path must traverse two switch modules 1004, at least K time slots must 
be vacant in a respective inlet port 1016 in the first switch module 1004(0), an outbound port 1008 of the first switch 

55 modulef 1004(0) (or, equtvalently, a corresponding inbound port 1006 of the second switch module 1004) and a respec- 
tive outlet port 1018 of the second switch module 1004. 

[0141] In a mesh interconnection of N > 2 switch modules 1004, there is a direct path and (N - 2) indirect paths 
between any two space switches, each indirect path comprising two links. A bufferiess two-link path requires a third- 
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order matching process, traversing three switch modules 1004, where a time slot in the path must be vacant at a 
respective inlet port 1016, the two inter-modular links defining the path, and a respective outlet port 1018. There are 
(N - 2) such paths between any inlet port 1016 of a switch module 1004 and any outlet port of another switch module 
1004. The probability of successful matching increases as N increases. The process of examining (N - 2) candidate 
5 paths in four ports, each path requiring finding K vacant time slots in a calendar of S time slots, can be intensive. For 
example, with N = 32, S = 256: the search space includes 7,680 time slots ( (N-2) x S). It is one of the objects of the 
present Invention to develop a technique for overcoming this difficulty. 

[0142] The preferable internal connectivity of the modular high-capacity, optical switch 1100 Is the mesh structure 
1200 as Illustrated in FIG. 12. In order to realize a network of high topological reach, high-capacity optical core nodes 

10 employing time sharing are needed. Trme-sharing switching may be realized by TDM switching or burst switching. 
Time sharing of a fiber link or an individual channel In a fiber link requires fast-switching and fast-switching optical 
switches are typically realizable in relatively small sizes (small number of ports per switch). The mesh structure 1200 
illustrated in FIG. 12 enables the construction of the modular, high-capacity, optical core node 1100 and, despite the 
resulting control complexity, the modular structure 1200 has advantages related to reliability. For instance, failure of a 

15 single switch module 1004 results in only a partial outage within the modular, high -capacity, optical core node 1100. 
[0143] In the mesh structure of FIG. 12, each switch module 1004 directly connects to each other switch module by 
at least one wavelength channel. An internal connection within a switch module 1004 requires only one single time- 
slot matching process (a first-order matching process). A direct connection between two switch modules requires a 
second-order time-slot matching process as described above. Due to typical traffic imbalance, some traffic streams 

20 may have to traverse an intermediate switch module 1004, in which case a third-order time-slot matching process is 
required. It is well known that inner capacity expansion, where the combined capacity of the inner ports 1006, 1008 
exceeds the combined capacity of the inlet and outlet ports 1016, 1018 in each module, can reduce, or even eliminate, 
the incidence of vacancy mismatch of a set of time slots that may fomn a path. Excessive inner-capacity expansion is, 
however, costly and techniques of reducing the mismatch probabilities by appropriate path -allocations are highly de- 

25 sirable. Successful path selection can be realized with a very high probability, without resorting to extensive and costly 
internal-capacity expansion, by appropriate path selection techniques with a slight inner expansion, as will be described 
below. 

[0144] To review the signal flow related to the modular, high-capacity, optical core node 1100, consider FIG. 13. A 
first electronic edge node 102Z communicates a connection request to the first switch module 1004(0) (step 1301). 

30 The connection request may, for instance, specify a second electronic edge node 102U and a number of requested 
time-slots. The connection request is switched from an inlet port of the first switch module 1004(0) to the first switch- 
module controller 1002(0) (step 1302). The first switch-module controller 1002(0) translates the connection request 
into a time-slot-allocation request that is subsequently sent to the master controller 1102 (step 1303). Eventually, the 
master controller 1102 sends configuration instructions (a connection schedule) to the first switch-module controller 

35 1002(0) that takes into account the lime-slot-allocation request (step 1304). Where the time-stot-allocation request 
was, at least partially, satisfied by a second order connection to the second switch module 1004(1), a connection 
schedule that takes Into account the time-slot-allocation request is also sent to the second switch-module controller 
1002(1) (step 1304). In step 1305 both switch-module controllers 1002, send updated configuration information to 
respective switch modules 1 004. The first switch-module controller 1 002(0) also sends an updated connection schedule 

40 to the switch module 1004(0) (step 1306) that is destined for the first electronic edge node 102Z (step 1307). 

[0145] When the time comes to use the connection schedule, the first electronic edge node 102Z sends data blocks 
to the first switch module 1004(0) (step 1308) arranged in time slots according to the connection schedule. Those time 
slots allocated to the connection specified in the connection request are then switched at the first switch module 1004 
(0) and sent to the second switch module 1004(1) (step 1309). The second switch module 1004(1) has been appro- 

45 priately configured (in step 1305) to direct those time slots allocated to the connection to the second electronic edge 
node 102U. 

[0146] When a switch module receives a connection request (step 1402 in FIG. 14), it associates an inlet port and 
an outlet port with the connection request (step 1404) and identifies the switch modules to which the inlet and outlet 
ports belong (step 1 405). Each switch-module controller maintains connectivl^y data of the entire modular switch (mod- 
so ular core node) and can, therefore, identify the switch modules associated with the inlet and outfet ports. It is noted 
that an inlet port specified in a connection request may not belong to the switch-module that receives the connection 
request. The switch-module controller also determines a number of time slots to be allocated and communicates a 
time-slot-allocation request to the master controller (step 1408). 

[0147] When a switch-module controller receives a connection request indicating an inlet port J and an outlet port 
55 K, and a number of time slots, it may formulate a time-slot-allocation requests that specifies a lesser number of time 
slots. This could happen if the switch-module controller receives a request to release a current connection-between 
the same inlet port J and outlet port K, or reduce the number of time slots allocated to a current connection. A lesser 
number of time slots may also be specified in the time-slot-allocation request under certain overload conditions, in 
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accordance with a servrce-level agreement. 

[0148] FIG. 15 illustrates the application of second-order time-slot matching process, where free time slots must be 
aligned in two stages of space switches, using a data structure based on the mesh stmcture of FIG. 1 2. In this example, 
each switch module 1004 has three inlet ports 1016 and three outlet ports 1018 interfacing with electronic edge nodes 

5 102. Further, each switch module 1004 has four inbound ports 1006 and four outbound ports 1008 connecting to other 
switch modules 1004. A state map associated with each switch module 1004 has 1 6 rows, each row corresponding to 
a time slot in a 16-slot calendar, and ten columns. The ten columns include a column corresponding to each of three 
inlet ports 1016, a column con-esponding to each of three outlet ports 1018 and a column corresponding to each of 
four outbound ports 1008. To have columns corresponding to the four inbound ports 1006 would be redundant as the 

10 same infonmation is conveyed by the columns corresponding to the outbound ports 1008 of other switch modules that 
connect to the four inbound ports 1006. 

[0149] The term channel-band when used herein, may refer to a single channel, a subset of channels or an entire 
WDM link. Each port handles a channel-band. Each entry in FIG. 1 5 corresponds to a time slot in a channel-band, and 
therefore a time slot in a calendar associated with a port. There is a column for each upstream channel-band from an 

IS electronic edge node 102 to an inlet port 1016, a column for each downstream channel-band from an outlet port 1018 
to an electronic edge node 102 and a column for each inter-modular link from an outbound port 1008 of a switch module 
1004 to an inbound port 1006 of another switch module 1 004. A channel-band to an inbound port 1006 in a designated 
switch module 1004 also connects an outbound port 1008 of another switch module 1004 to the designated switch 
module 1004 and, hence, the state of each of the inbound ports 1006 may be determined accordingly. A single bit may 

20 identify the state of an inter-modular link during a specific time slot, the state being simply "free" or "busy". An inter- 
modular link can be carrying a single wavelength channel or several channels. In FIG. 15, a time slot at an inlet fort 
1016 (column 1502) in the first switch module 1004(0) to an outlet port 1018 (column 1506) in the second switch module 
1004(1) is selected to use a single inter-modular link (column 1504). The path selection requires a time-slot matching 
process that requires the alignment of free corresponding time slots in the columns 1502,1504 and 1506. 

25 [0150] FIG. 16 uses the data structure of FIG. 1 5 to illustrate second-order and third-order matching processes for 
the mesh structure of FIG. 12. in FIG. 16, a path from an inlet port 1016 (column 1602) in the first switch module 1004 
(0) to an outlet port 1018 (column 1618) In the fifth switch module 1004(4) can be selected to use free corresponding 
time slots in column subsets represented by column reference numbers: {1602-1610-1618}, {1602-1604-1612-1618}, 
{1602-1606-1614-1618} or {1602-1608-1616-1618}. Notably, the first subset, {1602-1610-1618}, includes three col- 

30 umns, and each of the other subsets includes four columns. 

[0151] Compound matching is facilitated by inducing strong occupancy congelation among all ports In the modular, 
high-capacity, optical core node 1100 with a high link-occupancy gradient. This is realized by a path selection discipline 
that preferably applies to all possible Internal paths from an inlet port to an outlet port. 

[0152] FIGS. 17A and 17B illustrate occupancy of two inter-modular links, labeled LINK-10 and UNK-4. In FIG. 17A, 
35 the time-slot occupancies of UNK-10 are shown in a UN K- 1 0 occupancy plot 1701 A white the time-slot occupancies 
of LINK-4 are shown in a LINK-4 occupancy plot 1702A. In FIG. 17A, the occupancies of UNK-10 and UNK-4 appear 
to be weakly correlated. In FIG. 17B, the time-slot occupancies of LINK-13 are shown in a UNK-13 occupancy plot 
1701 B while the time-slot occupancies of LINK-19 are shown in a LINK-19 occupancy plot 1702B. In FIG. 178, the 
occupancies of UNK-13 and UNK-19 appear to be highly con^elated. Such a highly-conrelated pattern assists to in- 
40 crease the probability of successful time-slot matching. 

[0153] FIG. 18A illustrates occupancy of two inter-modular links, labeled UNK-1 1 and LINK-9. In FIG. 18A, the time- 
slot occupancies of LINK-11 are shown in a UNK-1 1 occupancy plot 1801 A while the time-slot occupancies of LINK- 
9 are shown in a LINK-9 occupancy plot 1802A. In FIG. 18A, the occupancies of LlNK-11 and LINK-9 appear to be 
highly correlated and each exhtoit a moderate occupancy gradient. In FIG. 18B, the time-slot occupancies of LINK-20 
45 are shown in a LlNK-20 occupancy plot 1801 B while the time-slot occupancies of LINK-2 are shown in a LINK-2 
occupancy plot 1802B. In FIG. 18B, the occupancies of LINK-20 and LINK-2 appear to be highly correlated and exhibit 
a high occupancy gradient. A higher time-slot matching probability is realized with high occupancy correlation and high 
occupancy gradient. 

so Port numbering scheme 

[0154] Inlet and outlet ports are preferably identified according to the switching modules and port positions within 
the switch modules. Within a switching module, an inlet or an outlet port is given a number between 0 and a specified 
maximum. Preferably, the specified maximum port number is unified for all switching modules, even if the actual number 
ss of ports provisioned in a given module is substantially less than the maximum. Thus, if the specified maximum number 
of ports is P, an inlet-port identifier takes the form j.m, 0 < j < P and an output-port identifier takes the form k.n, 0 < k 
< P, j being the relative input-port position within switch module m and k the relative port position within switch module 
n. An inlet port j.m.and an outlet port k.n belong to the same switch module only if n = m. An internal link, whether 
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outbound or inbound, is also Identified as a concatenation of an identifier, m, of a first switching module fronn which 
the lini< originates and an identifier, n, of a second switch module on which the link temiinates. Thus, an internal link 
would have an identifier m.n, 0 < m < Q, and 0 < n < Q , Q being the maximum number of switching modules in the 
modular core node. Selecting the maximum number of inlet ports to be 256, the maximum number of outlet ports to 
5 be 256, and the maximum number of switching modules to be also 256, for example, an inlet port or an outlet port can 
be conveniently represented by two bytes (octets). 

[0155] FIGS. 19A and 198 Illustrate a link labeling scheme for the inter-moduiar links and, by extension, for the 
inbound ports 1006 and the outbound ports 1008. The inter-modular links outbound from a switch module 1004(m), 

0< m < N-1 N-1 being the number of switch modules 1004, are labeled from mxP to (m+1)xP-1 , where P is an upper 
10 bound of the number of inbound ports 1006 outbound ports 1008, inlet ports 1016 or outlet ports 1018 for any switch 
module 1004. For the exemplary network of FiG. 19A, P = 5 and for the third switch module 1004(2) (m = 2) the 
outbound inter-modular links are labeled from 2x5 to (3) x5-1 , that is, from 1 0 to 14. For ease of labeling, a non-existing 
link from each switch module to itself is assigned a label. This link labeling scheme is applied even if some or all of the 
switch modules 1004 employ fewer than P inbound ports 1006. Thus, the outbound ports 1016 of each module are 
?5 labeled with consecutive numbers while the inbound ports 1006 of consecutive modules are labeled with numbers that 
are successively separated by P. 

[0156] In the structure illustrated in FIGS. 19A and 19B, the two sides of each switch module 1004 have the same 
number (P) of Inbound ports 1006 and outbound ports 1008. As will be apparent to a person skilled in the art. it is 
possible, and even likely, that a modular, high -capacity, optical core node 1100 may be constructed with a number of 

20 switch modules 1004 that is not equal to the number of inbound ports 1006 on each switch module 1004. 

[0157] FIG. 20A illustrates a port numbering scheme based on a first matrix 2000A with the horizontal axis of the 
first matrix 2Q00A indexing the inlet ports 1016 and the vertical axis indexing the switch modules 1 004. The first matrix 
2000 A sets out a structure for a sub-section of an inlet port-state memory 2411 that will be discussed in further detail 
hereinafter with reference to FIG. 24. Each sub-section of the inlet port-state memory 2411 is used to maintain an 

25 indication of the state (i.e., busy or free) of each inlet port 1016 for a single time slot, A second matrix 201 OA is illustrated 
in FIG. 20A that, for a particular time slot, indicates the state of each inlet port 1016 and, therefore, may be called an 
inlet port-state map. A "0" in the second matrix 201 OA indicates a free state for the inlet port 1 01 6 to which the location 
in the second matrix 2010A relates, while a "1" In the second matrix 2010A Indicates a busy state. Note that the state 
represented in the second matrix 201 OA is for a particular time slot in the calendar associated with each of the labeled 

30 inlet ports 1016. 

[0158] FIG. 20B illustrates a port numbering scheme based on a first matrix 2000B with the horizontal axis of the 
first matrix 20008 indexing the outlet ports 1018 and the vertical axis indexing the switch modules 1004. The first matrix 
2000B sets out a structure for a sub-section of an outlet port-state memory 2412 that will be discussed In further detail 
hereinafter. Each sub-section of the outlet port-state memory 2412 is used to maintain an indication of the state of 
35 each outlet port 1018 for a single time slot. A second matrix 201 OB is illustrated in FIG. 20B that, for a particular time 
slot, indicates the state of each outlet port 1018 and, therefore, may be called an outlet port-state map. For the meaning 
of a "0" or a "1" in the second matrix 20108 see the discussion of FIG. 20A hereinbefore. 

[0159] For uniformity, as mentioned above, the Inlet ports 1016 in a switch module m are numbered asmxP to (m+1) 
X P-1 and the outlet ports 1 01 8 are numbered likewise, P being the specified maximum number of ports. Consequently, 
40 . even though the switch modules 1004 under consideration have only three inlet ports 1 016 and three outlet ports 1 01 8, 
these ports are labeled as if there were five of each. 

[0160] In view of FIG. 20A and FIG. 20B, a time-slot-allocation request identifying the inlet port 1016 labeled 12 and 
the outlet port 1018 labeled 11 may be satisfied, for the particular time slot with which the second matrices 2010A, 
2010B are associated, within the third switch module 1004(2), requiring only a first-order matching process. Note that 
45 In the second matrix 2010A of FIG. 20A the Inlet port 1016 labeled 12 is free and in the second matrix 2010B of FIG. 
20B the outlet port 1018 labeled 11 Is free. 

[0161] A time-slot-allocation request Identifying the inlet port 1016 labeled 12, of the third switch module 1004(2), 
and the outlet port 1018 labeled 20, of the fifth switch module 1004(4), requires a compound matching process. The 
first step of such a compound matching process is determining, for a particular time slot, that the inlet and outlet ports 

50 1016, 1018 of interest are free. 

[0162] FIGS. 21 A and 21 B assist in the compound matching process. FIG. 21 A illustrates a port numbering scheme 
based on a first matrix 2100A with the horizontal axis of the first matrix 2100A indexing the outbound ports 1008 and 
the vertical axis indexing the switch modules 1004. By labeling the outbound ports 1008, the inter-modular links have 
also, In effect, been labeled. The first matrix 21 00 A sets out a structure for a sub-section of an outbound port-stag 

55 memory 2413 that will be discussed In further detail hereinafter Each sub-section of the outbound port-state memory 
2413 is used to maintain an indk^atlon of the state (i.e., busy or free) of each outbound port 1008 for a single time slot. 
A second matrix 2110A is illustrated in FIG. 21 A that, for a particular time slot, indicates the state of each outbound 
ports 1008 and, therefore, may be called an outbound port-state map. By extension, the state of each Inbound port 
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1006 and each inter-modular link is also indicated, for a particular time slot, by the second matrix 2110A. Notably, the 
state of the outbound port 1008 of the switch module 1004 bearing the same Index is always "1" (i.e., each element in 
a diagonal in matrix 2110A always has a state of "1 "). 

[0163] The second step of the compound matching process discussed in conjunction with FIGS. 20A and 20B is 
5 detennining that, for the particulartime slot, an outbound port 1008 of the origin switch module 1004 and acorrespond- 
ing inbound port 1006 of the destination switch module 1004 are free. In the case of a direct inter-modular link between 
the switch modules 1004, the stale of only one inner port 1006,1008 need be assessed. However, in the case of a path 
from an origin switch module 1004 to a destination switch module 1004 passing through an intennediate switch module 
1004, there is a need for the stale of two outbound ports 1008 to be assessed (namely an outbound port 1008 on the 
10 origin switch module 1004 and an outbound port 1008 on the intermediate switch module 1004), This is equivalent to 
assessing the state of one outbound port 1008 and one inbound port 1006 (namely an outbound port 1008 on the origin 
switch module 1004 and an inbound port 1006 on the destination switch module 1004). 

[0164] This latter assessment may be accomplished for a particular time slot by comparing the row, of the second 
matrix 211 OA of FIG. 21 A, that conresponds to the origin switch module 1004 with the column, of the same second 

IS matrix 21 1 0A, that corresponds to the destination switch module 1 004. In the outbound port-state memory that main- 
tains a copy of the second matrix 211 OA for each time slot, the comparing requires that the memory locations of the 
row be read. Such a row read operation can be accomplished with a single memory access where the state information 
is maintained in contiguous memory locations, as is preferred. The comparing also requires that the column be read. 
Such a column read operation would take five memory-access steps for the exemplary second matrix 2110A. 

20 [01 65] To facilitate the third-order matching process, a further state memory may be created. Nominally, this further 
state memory may be an Inbound port-state memory, but the structure of each sub-section, as represented by a first 
matrix 2100B of FIG. 21 B, is a transpose of the first matrix 2100A of FIG. 21 A. This transposition of the first matrix 
21 00 A of FIG. 21 A allows the column read operation mentioned above to be accomplished as a row read operation, 
requiring just a single read command, thereby requiring less processing time. Accordingly, a second matrix 211 OB is 

25 illustrated in FIG. 21 B that, for a particular time slot, indicates the state of each inbound ports 1006 and, therefore, 
may be called an inbound port-^te map. The second matrix 21 10B of FiG, 21 B is a transpose of the second matrix 
211 OA of FIG. 21 A. 

[0166] An indicated row2102Ain FIG. 21 A contains the identities of outbound ports 1008 of the third switch module 
1004(2) while an indicated row 21048 in FIG. 21 B contains the identities of inbound ports 1006 of the Hfth switch 
30 module 1004(4). Note that the identities in the indicated row 2104B of FIG. 21 B are the same as the identities in an 
indicated column 2104A of FIG. 21 A. 

[0167] To select a path, as requested above, from the inlet port 1016 labeled 1 2 of the third switch module 1004(2) 
to the outlet port 1018 labeled 20 of the fifth switch module 1004(4), a compound matching process is required. That 
is, the same time slot must be found vacant not only at the inlet port 1016 labeled 12 and the outlet port 1018 labeled 

35 20 but also on any inter-modular links in a path from the third switch module 1004(2) to the fifth switch module 1004(4). 
[0168] Using the identification of ports presented in FIGS. 19A and 19B, an outbound port 1008 and a corresponding 
inbound port 1006 can be represented by a single label (number). This single label is the label for the inter-modular 
link that connects the con^esponding ports. For example, compare the indicated row 2104A in FIG. 21A, containing 
the identities of outbound ports 1008 of the third switch module 1004(2). to the labels on the inter-modular links ema- 

40 nating from the third switch module 1004(2) in FIG. 19A. Note that diagonal entries of the first matrix 2100A of FIG. 
21 A and of the first matrix 2100B of FIG. 21 B are italicized to indicate that the diagonal entries are null entries, i.e., 
the diagonal entries do not correspond to actual inter-modular links. 

[0169] The number of pairs of inter-modular links is one less than the number outbound ports 1008 for the origin 
switch module 1004 under consideration (i.e.. P - 1). To select a path from the third switch module 1004(2) to the fifth 

45 switch module 1004(4), a path finding process should consider the state of the following inter-modular links: {10, 4}, 
{11, 9}, (13, 19), {14, Null}. Notably, these pairs may be determined by aligning the indicated row2104Aof FIG. 21 A 
with the indicated row 2104B of FIG. 21 B. The latter of these pairs is representative of a single inter-modular link from 
the third switch module 1004(2) to the fifth switch module 1004(4), requiring only a second-order matching process. 
Recalling that a first-order matching process is necessary above to match a vacant tim.e slot in an inlet port 1016 to a 

50 vacant lime slot in an outlet port 1018 of the same switch module 1004, the second order matching process adds the 
additional requirement of matching a vacant time slot in an outbound port 1008. The rest of the pairs of inter-modular 
links require a third-order matching process. The third-order matching process adds the requirement of matching a 
vacant time slot in an outbound port 1008 of an intermediate switch module 1004 (or a vacant time slot in an inbound 
port 1008 of the destination switch module 1004). 

55 [0170] For the example time slot represented by the second matrix 2010A of FIG. 20A, the second matrix 2010B of 
FIG. 20B, the second matrix 2110A of FIG. 21 A and the second matrix 2110B of FIG. 21 B, the compound matching 
process required to select a path from the inlet port 1016 labeled 12 of the third switch module 1004(2) to the outlet 
port 1018 labeled 20 of the fifth switch module 1004(4) begins by establishing that the inlet port 1016 labeled 12 and 
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the outlet port 1018 labeled 20 are free. Once inlet and outlet ports have been established to be free, the inter-modular 
link from the third switch module 1004(2) to the fifth switch module 1004(4) is considered.lf that inter-modular link, 
labeled 14 in FIG. 19A and corresponding to the outbound pork labeled 14 in FIG. 21 A, is free, then the second-order 
matching process is complete and the result {14, Null) is output. If that inter-modular link is busy, as it Is in the example 
state map given by the second matrix 2110A of FIG. 21 A, then the second-order matching process is attempted for 
the next time slot. If a second-order matching process has been attempted, and failed, for every time slot, a third order 
matching process may be aUempted, again on a time slot by time slot basis. 

[0171] Where a third-order matching process is attempted on the time slot represented in FIGS. 21 A and 21 B, the 
states of the inter-modular links in the pairs {10, 4}, {11 , 9}, (13,19) are considered. Although the outbound port 1008 
labeled 1 0 at the third switch module 1004(2) is free, the outbound port 1008 labeled 4 at the first switch module 1004 
(0) Is busy. Additionally, the outbound port 1 008 labeled 1 1 at the third switch module 1 004(2) is busy and the outbound 
port 1008 labeled 9 at the second switch module 1004(1) is free. Fortunately, the outbound port 1008 labeled 13 at 
the third switch module 1004(2) is free and so is the outbound port 1008 labeled 19 at the fourth switch module 1004 
(3). The third-order matching process is then considered to be complete and the result {13, 19) is output. 
[0172] Alternative labels for ports are also contemplated. Such alternative labels represent a port with two binary 
values separated by a period. The first value Is the index, m, of the switch module 1004(m) and the second number 
is a relative port number. For the above example, the inlet port 1016 labeled 12 may be alternatively labeled 01 0.01 0, 
i.e., the switch module 1004(2) number2 andthe relative inlet port 1016 number 2. Equally the outlet port 1018 labeled 
20 may be alternatively labeled 100.000, i.e., the switch module 1004(4) number 4 and the relative outlet port 1018 
number 0. 

[0173] As described earlier, in a first-order matching process, occupancy packing increases the throughput of a 
space switch. Occupancy packing in a time-multiplexed space switch increases the temporal occupancy gradient of 
the input and output ports of the space switch, which tends to maximize a probability of vacant time-slot alignment. 
The probability of free time-slot alignment decreases as the order of the matching process increases. In the mesh 
structure 1200 (see FIG. 12) of the modular, high -capacity, optical core node 1100 in accordance with the present 
invention; the relatively low probability. of time-slot alignment along a single internal path (made up of one or more inter- 
modular link) within the mesh structure 1200 is offset by the availability of multiple internal paths. This, however may 
require intensive processing. The pipelined processing method according to the present invention enables a high rate 
of connection routing within the mesh structure 1200 as will be described below with reference to FIG. 23. 
[0174] Switch-module controllers receive release requests from edge nodes and communicate the requests to the 
master controller In an adaptive network, an edge node may modify its path-capacity requirements to follow traffic- 
load variation. Thus, an edge node may request more time slots per time frame or may offer to release a number of 
time slots in a current connection to another edge node. The master controller need not be aware of the individual 
connections managed by the switch-module controllers. Thus, each switch-module controller formulates a release 
request indicating the inlet port, the outlet port, and any traversed internal link. 

[0175] It is preferable that release requests be given the highest priority for two reasons. The first is that the release 
requests require a negligible processing effort. The second is that releasing resources as soon as possible facilitates 
the scheduling process. 

[0176] FIG. 22 partly illustrates the connection-control circuit 1104. The switch-module controllers periodically send 
parameters of connection requests to the master controller The requests are received at consecutive intervals of time 
and selector 2210 directs the release requests and connection requests to a request memory 2212 from which the 
requests are placed in buffers 2220, 2222, 2224. or 2226 according to the request type as determined by request sorter 
2214. An additional buffer 2228 receives continuation requests from a "tail" scheduler (scheduler number H) as will be 
described below. 

[0177] The core node may be operated in a TDM mode, as described earlier where each TDM frame is divided into 
H > 1 sub-frames. A cascade of H > 1 schedulers is provided and the h*^ scheduler 1 < h < H, covers 'V^> 1 time slots 
per TDM frame. The V^, time slots need not be consecutive. It is convenient, however to allocate consecutive locations 
In a calendar, so that the h^^ scheduler is allocated time slots to + H'^ -1 : with x^ = 0. The actual time slots can be 
a one-to-one mapping of the consecutive time slots. This data is virtually static and is set when the scheduling mech- 
anism is installed or modified. Each switch-module controller must be aware of the values x^, and 4^^ for each of the H 
schedulers, and the corresponding mapping. For example, if the total number S of time slots per TDM frame (per 
calendar) is a power of 2 (such as 1 ,024), a reverse-binary mapping can be used to distribute the time slots allocated 
by a scheduler along the entire TDM frame. Reverse binary mapping is derived by reading consecutive numbers in 
the reverse binary order, i.e., the least-significant bit becomes the most significant bit, and vice-versa. 
[0178] As illustrated in FIG. 22, there are five buffers, 2220, 2222, 2224, 2226, and 2228, at the input of the head 
scheduler (scheduler 2312-1 in FIG. 23). The first buffer 2220 contains release requests. The second buffer 2222 
contains new requests where the inlet port and outlet port belong to the same switch module. The third buffer 2224 
contains new requests where the intet port and outlet port are on different switch modules and a direct internal link 
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from the originating switch module and terminating switch module is provided. If a direct link is not provided, the request 
is placed in the fourth input buffer 2226. With a fully-meshed switch modules, a direct internal link is provided for each 
switch-module pair and the fourth input buffer 2226 is not needed. The fifth buffer 2228 contains only requests, initially 
placed in the third buffer 2224, that were processed by the cascade of schedulers according to a second-order matching 

5 process but the second-order matching process failed to find the required number of allocable time slots. A type of 0 
is associated with each release request placed in the first input buffer 2220, a type of 1 is associated with each con- 
nection request placed in the second or third input buffers, 2222, 2224, and a type of 2 is associated with each con- 
nection request placed in the fourth or fifth input buffer, 2226, 2228. Each of the H schedulers distinguishes a type-1 
connection request having parameters j.m.. and k.n by comparing m and n. When m = n, the connection requires only 

10 a first-order matching process. The purpose of using separate buffers (2222 and 2224) for type-1 requests-is to enable 
a service-priority discipline for ail the five buffers. It is noted that the first four buffers 2220, 2222, 2224, 2226, can 
share a memory device because only one is accessed at any time. The fifth buffer 2228 is preferably a separate memory 
device because it is accessed independently. 

[0179] A 1 :2 distributor 2304 and a 1 :H distributor 2308 (see FIG. 23) are components of the path -selection device 
IS 1106. The connection-control circuit 1104 controls the dequeuing of the release requests waiting in buffer 2220. Pref- 
erably, release requests are given top priority in order to release reserved time slot as soon as the reserved time slots 
become Idle. 

[01 80] The connection-control circuit 1 1 04 also controls the dequeuing of the connection-request buffers 2222, 2224, 
2226, and 2228, and the head scheduler processes one connection-request at a time. In a first preferred dequeuing 

20 method, the buffers are dequeued according to a predetennined priority order. In a second method, the buffers are 
dequeued in a cyclic order, with each of the four buffers allocated a time interval during which some or all of Its waiting 
requests are processed by the head scheduler. When a connection-request buffer is empty, the connection-control 
circuit 1104 proceeds to dequeue another connection-request buffer. If the first method is used, the preferred priority 
order is: 2224, 2228, 2226, then 2222. The rationale behind this selection is that giving inter-module connections 

25 waiting in buffer 2224 a high priority increases the proportion of connections using a single link 420. A connection 
request waiting In buffer 2228 may have already reserved time slots along a direct link which would remain idle until 
the connection is completely scheduled. Buffer 2226 Is only used when a direct link 420 is not provided, and a con- 
nection-request waiting in the buffer is not holding any time slots. Finally, a connection request waiting in buffer 2222 
has a high probability of acceptance and may then be given the lowest priority. 

30 [01 81 ] In the second method, the connection -control circuit 1 1 04 allocates a time interval for dequeuing each buffer. 
When the interval allocated to de-queue a buffer expires, or when the buffer becomes empty, a selector 2230 proceeds 
to the next buffer in a cyclic fashion so that the third buffer 2224 follows the second buffer 2222 and the second buffer 
2222 follows the fifth buffer 2228. Note that the release-request buffer 2220 is always given the highest priority and its 
dequeuing interieaves the dequeuing of the four connection-request buffers. For example, each of the intervals may 

35 be selected to be one millisecond, If the spatial distribution of the traffic is uniform, a small proportion of connections 
would be confined within switching modules, i.e., requiring a first-order matching process, and the majority of connec- 
tions would be established over direct internal links connecting switching modules. Thus, even though the four con- 
nection-request buffers are granted equal service Intervals, the head scheduler receives most of its requests from the 
third buffer 2224, and the other four intervals would be shortened below the assigned one millisecond. With non-uniform 

40 spatial distribution of traffic, where the connection requests received at a switching module are directed to a few of the 
(N-1 ) other switching modules, a significant proportion of the connections may have to use two-link paths, each req uiring 
a third-order matching process, and the fourth allocated interval would be the busiest of the assigned four intervals. 
[0182] Global time-slot occupancy packing, more fully described hereinafter, could be a time-ccnsuming process, 
which, in some instances, may be too slow to be practically useful if implemented according to traditional path selection 

45 methods. However, the use of cascaded schedulers 2312, to be described with reference to FIG. 23, in the master 
controller 1102 enables path selection within the modular high-capacity, optical core node 1100 at very high rates. A 
single scheduler 2312 may be limited to paih selection for a predetennined number of time-slots in a calendar. Each 
scheduler 2312 in this cascaded (pipelined) structure uses memory devices 2408 to hold a state map indicating the 
busy/idle state for each inlet port 1016 and each outlet port 1018 for the predetermined number of time slots, i.e., the 

50 time-slot band handled by a stage. Advantageously, each entry in each state map need only be one-bit wide {see the 
discussion of FIGS. 21 A, 21 B). If a very high path selection rate is not required, the number of cascaded schedulers 
2312 can be reduced. Such a reduction requires that each scheduler 2312 handle more time-slots per calendar. 
[01 83] A strategy of global time-slot occupancy packing is used herein to promote order in the path selection process, 
wherein each time-slot-allocation request is processed in the same order in the path selection device 1106. That is, a 

55 path to satisfy each time-slot-allocation request is sought starting with the time-slot band A, corresponding to the first 
scheduler 2312A of the path selection device 1106 (FIG. 23). If. a path is not found for all the requested time-slots, an 
amended version of the time-slot-allocation request may be passed to the second scheduler 231 2B, and so on. By 
always processing the time-stot-allocatlon requests in this order, the early time slots will tend to have high occupancy 
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and the later time slots will tend to have low occupancy. This occupancy pattern is known to resuft In a more efficient 

matching process. 

[0184] FIG. 23 illustrates elements of the path selection device 1106 of the master controller 1102 of FIG. 11. The 
path-seiection device 1 1 06 receives both connection requests and release requests from the selector 2230. The search 

5 effort for finding an internal path can be divided among a number of schedulers 2312, where each scheduler 2312 
handles the search over a prescribed "band" of time slots. The number. S, of time slots per calendar is preferably large, 
of the order of 1 ,024 for example. The S time slots may be grouped into J sets, where J < S, and each of the J sets 
may be associated with a processing unit. Each of the schedulers 231 2 is a time-slot matching device that can perform 
first-order, second-order, or third-order matching processes. 

10 [0185] A 1 :2 distributor 2304 directs connection requests to a head scheduler 2312-1 and release requests to a 1: 
H distributor 2308 which places each release request in a release-request buffer associated with each scheduler 231 2. 
The appropriate schedulers 2312 for each release request are determined by request sorter 221 4. Each switch-module 
controller associates each release request received from edge nodes 102 with one or more of the H schedulers 2312 
and where a release request covers more than one scheduler 231 2, the release request is divided into release requests 

15 each covering only one scheduler, it is noted that, in order to realize temporal packing, each connection requests must 
be offered to the head scheduler and proceed through the an^ay of schedulers 2312 until the number of time slots 
specified in the connection request is allocated or all internal routes have been unsuccessfully attempted. In contrast, 
a release request specifies the time slots to be released and the corresponding inlet port, outlet port, and inner links. 
Therefore, it would be wasteful to let a release request proceed along the an-ay of schedulers 2312. The path selection 

20 device 1106 makes use of cascaded stages to expedite a, path selection process. The number of time slots in a 
scheduling period may be arranged as time-slot bands (of, for example, eight time-slots per band) so that each stage 
of the path selection device 1106 may correspond to a single time-slot band. The cascaded schedulers of FIG. 23 are 
labeled 1 through H. Each of a set of schedulers 2312-1, 2312-2, 2312-(H-1), 2312H communicates with a corre- 
sponding one of a set of result buffers 2314-1, 2314-2, 2314-(H-1), 2314H (refen-ed to individually or collectively 

25 as 231 4). The output of the result buffers 231 4 is transmitted to a cyclic selector 2316 from which selected results are 
transmitted to switch-module controllers 1002. Note that, at any instant of time, it is possible that all the H schedulers 
2312 be engaged in processing connection requests. The processing gain, hence the increased scheduling throughput, 
derives directly from this simultaneous processing. The processing gain of a system employing multiple processing 
units may be defined as the ratio of the mean processing throughput to the throughput of a system employing a single 

30 processing unit. 

[0186] The cyclic selector 231 6 transfers the results of all schedulers to the connection-control circuit 1 104 (see FIG. 
11), A path-search attempt perfomned by the path selection device 1 106 may terminate successfully at any scheduler 
2312 of the path selection device 1106. Only unsatisfied time-slot-allocation requests are processed in alt schedulers 
2312. Notably, while the time -slot-allocation requests arrive sequentially at the path selection device 1106, successive 

55 time-slot-allocation requests may temiinate concurrently at different schedulers 231 2. Each scheduler 231 2 is therefore 
provided with a result buffer 2314 to store each result record that is a product of satisfaction, at least in part, of a time- 
slot-allocation request. Alternatively, the result buffer 2314 may store an identity (say, a cyclical request number) that 
points to a result record, where the record stores attributes of the path selected to satisfy the time-slot-allocation request. 
The cyclical selector 231 6 visits the result buffers 2314 and, under control of a de-queue circuit (not illustrated), reads 

40 the content, if any of each result buffer231 4 (or of the result record to which the identity in the result buffer 231 4 points) 
and transfers the content to the connection -control circuit 1 104 in the master controller 1102 (FIG. 11). With proper 
selection of the capacities of the inner ports 1006,1008 and the inlet and outlet pots 1016,1618 of each switch module 
1004, the incidence of unsuccessful path-search attempts can be made negligibly small. 

[0187] As stated hereinbefore, the time slots of the calendar of interest may be collected into time-slot bands to 
45 enhance processing efficiency Each stage of the path selection device 1 106, including a scheduler 2312 and a result 
buffer 231 4, corresponds to a single time-slot band. A time-slot band may include only one time slot. However, when 
the number of time slots per TDM frame (per calendar) is large, 1 ,024 for example, a time-slot band may contain several 
time slots. There is no requirement that each of the time-slot bands considered by a path selection device 1106 have 
the same number of time slots. In fact, there may be significant advantages realized by arranging the time-slot bands 
50 SO that the earlier time-slots bands contain a small number of time slots, while the later time-slots bands contain a 
larger number of time slots. 

[0188] If, for example, time-slot-allocation requests are received at a rate of one million per second in a 20-band 
path selection device 1106. and with proper selection of the time-slot bands, the result records would be queued in 
the result buffers 231 4 at approximately the same rate, with a negligible request rejection rate. A worst-case de-queue 
55 delay may occur when all the time-slot-allocation requests terminate successfully at a single given time-slot band. The 
cyclic selector, which is unaware of the content of individual result buffers 2314, must visit all the result buffers 2314. 
With an access time of 20 nanoseconds, for example, there may be an overhead time of about 0.4 microseconds (20 
buffers and 20 nanoseconds access time) and the available de-queue time per time-slot-allocation request may be 
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about 0.6 microseconds. If the number of schedulers is large, 100 for example, an auxiliary cydic selector (not illus- 
trated) may be used to concurrently "look ahead" and list, in an auxiliary buffer (not illustrated), only those result buffers 
2314 that contain results. In any case, the de-queue delay may be considered insignificant because a high overhead 
results in more accumulation and a self-adjusting relative reduction in overhead. A scheduler 2312 can only process 
5 one request at a time. Thus, it dequeues a request from a request buffer only after It either places a record in its result 
buffer and/or after it places the request parameters into a cascade buffer, i.e., the connection-request buffer of a sub- 
sequent scheduler, for further processing. 

[0189] It is noted that a direct link connecting a first switch module to a second switch module may be reserved for 
the exclusive use of connections from inlet ports of the first switch module to outlet ports of the second switch module 
10 if the total rate of such connections is known to be sufficiently high. Under high load, connection requests from the first 
switch module to the second switch module may still be accommodated using a two-link path through an intemriedlate 
switch module. 

[0190] FIG. 24 Illustrates one of the cascaded stages of FIG. 23, where a stage includes a scheduler 2312 and a 
result buffer 2314. A connection -request buffer 2402, as part of the scheduler 2312, receives incoming time-slot-allo- 

is cation requests either from the connection-control circuit 1104 (for the head scheduler 2312-1) or from the previous 
scheduler (for the scheduler 231 2-2 through to the tail scheduler 231 2-H) and buffers these time-stot-allocation requests 
before transmitting the time-slot-aHocation requests to a path finder 2404. A release-request buffer 2410 receives 
requests to release paths allocated for time slots covered by the specific scheduler 2312, The path finder 2404 gives 
the release- request buffer priority over the request buffer 2402 in order to exploit the released time slots in accommo- 

20 dating any new connection requests in connection-request buffer 2402. 

[0191] As discussed hereinbefore, a time-slot-allocation request may, for instance, identify an inlet port 1016, an 
outlet port 1 01 8 and a number of time slots required per scheduling period. The path finder 2404 has access to memory 
devices 2408 for maintaining an inlet port-state memory 2411. an outlet port-state memory 2412, an outbound port- 
state memory 2413 and an inbound port-state memory 2414 that relate to the time-slot band assigned to the scheduler. 

25 A selector 2406 receives a result of a path finding process carried out by the path finder 2404. If the process has been 
successful, i.e., a path has been found for at least one of the time slots requested, a result record that indicates the 
found path or paths is sent to the result buffer 231 4. The result record is then selected by the cydic selector 2316 (FIG. 
23). Where the process has been unsuccessful, at least in part, i.e., a path has been found for fewer than ail of the 
time slots requested, the time-slot-ailocation request (amended to reflect the selection of a path for some, though not 

30 all, of the requested time slots) may be sent to the next scheduler 231 2 (forthe head scheduler 2312-1 and the scheduler 
2312-2 through to the scheduler 2312-(H-1)) or to the connection-control circuit 1104 (for the tail scheduler 231 2-H). 
[01 92] Consider that the time slots of a given calendar may be collected into time-slot bands, as described above in 
conjunction with FIG. 23, where the time-slot bands do not necessarily have equal numbers of time slots and that a 
given time-slot-allocation request may indicate a need for more than one time slot in the calendar. Each scheduler 

35 2312 of the path selection device 11<W may be associated with four memory devices 2408 that cover a particular time- 
slot band. These four memory devices may include, for instance, an inlet port-state memory 2411 (FIG. 20A), an outlet 
port-state memory 241 2 (FIG. 20B), an outbound port-state memory 241 3 (FIG. 21 A) and an inbound port-state memory 
2414(FIG. 21 B). 

[0193] As noted earlier, the number of time slots in each time-slot band is arbitrary and may vary from one time-slot 
40 band to another. The mean search time in a time-slot band Increases with the number of time slots in the time-slot 
band. Increasing the number of time-slot bands in a calendar increases the path selection capacity of the path selection 
device 1106. Such an Increase in the number of time-slot bands reduces the size of individual memory devices 2408 
but increases the required numberof memory devices 2408. 

[0194] Configuration of the path selection device 1106 may be under the control of the connection control circuit 
45 1104, which may monitor the mean search time. Based on changes in the mean search time, the connection control 

circuit 1104 may configure the path selection device 1106 to alter the numberof time slots in individual time-slot bands. 

[0195] FIG. 25 shows an example output of the path selection device 1 106 of the master controller 1 102 in response 

to the receipt of a time-slot-altocation request. A first exemplary result record 2502 illustrates the selection of a path 

for a single time slot per calendar using a direct link {14, NULL}. Where the time-slot-allocation request required q time 
50 slots per calendar, with q = 4for example, the time-slot-allocation request may be considered to have been only partially 

satisfied. Other exemplary result records 2504 illustrate the selection of a two-link path for each of the remaining three 

time slots of the time-slot-allocation request. 

[0196] For each path in the example of FIG. 25, it is assumed that the path selection device 1106 found the vacant 
time slot as close to a reference time slot in the calendar as possible. In view of FIG. 23, it appears that the first result 
55 record 2502 is a result of path selection performed by the scheduler 231 2-2 associated with the second time-slot band 
and the result records 2504 are results of path selection perfomned by the head scheduler 2312-1 associated with the 
first time-slot band. 

[0197] The q requested time slots may be allocated any number, less than or equal to the number, q, of inter-modular 
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paths. The path for a given time slot may be described by one or more Inter-modular link identifiers, where each link 
identifier is equivalent to a label of an outbound port 1008 and the corresponding inbound port 1006. The entries in 
each row in the result records 2502, 2504 contain a first inter-modular link identifier, L1 , of a first inter-modular link, a 
second inter-modular link identifier L2 of a second inter-module link for a path of two Inter-module links, and a tlme- 

5 slot index, T (a number between 0 and S-1 , S being the number of time slots per calendar). The value of q is less than 
or equal to S and is typically substantially less than S. The identifiers 1.2 are initialized as null entries. It is noted that 
the second link identifier L2 can be determined, by an inlet switch module, based on L1 and the outlet switch module, 
however it may be provided by the path-selection device 1106. A null value is assigned to L2 if link LI connects the 
inlet switch module directly to the outlet switch module specified in a connection request. (An inlet switch module and 

10 an outlet switch module are the switch modules that respectively include the Inlet and outlet ports specified for a con- 
nection.) 

[0198] In a preferred embodiment, on first pass through the stages of the path selection device 1106 (FIG. 23), the 
schedulers 2312 attempt a second-order matching process (see FIG. 29). Any time-slot-al location requests rejected 
by all (i.e., not completely satisfied by any) of the schedulers 2312 are returned to the connection control circuit 1104 

IS (buffer 2228). The connection control circuit 1 104 may then re-introduce the rejected time-slot-allocation requests to 
the path selection device 1106 so that the schedulers 2312 may attempt a third-order matching process (see FIG. 30). 
Such a two-pass process helps to ensure that if a time slot is available wherein a direct connection between switch 
modules 1004 may be established, that time slot is allocated before considering a path via an intennediate switch 
module 1004. The two-pass process requires the secondary buffer 2228 to store time-slot-allocation requests that 

20 were not fully accommodated by a second-order timeslot matching. The secondary buffer 2228 may be part of, or 
controlled by, the connection-control circuit 1104 as described earlier. 

[0199] A connection request basically specifies an inlet port and an outlet port of the modular switch and a required 
number of time slots per time frame (per calendar). The switch-module controller must then identify the corresponding 
inlet switch module and outlet switch module. It is noted that the inlet switch module is not necessarily the switch 

25 module that receives the connection request. Referring to FIG. 26, an inlet port identification array 2610 having P 
entries, and an outlet port identification array 2620 having Q entries, where P is the number of inlet ports and Q is the 
number of outlet ports, can be used for associating a switch module with an inlet port or an outlet port. The inlet port 
identification array 2610 is indexed by an inlet-port identifier to read an identifier 2612 of the inlet switch module. 
Likewise, the outlet port identification array 2620 is indexed by an outlet-port identifier to read an Identifier 2622 of the 

30 outlet switch module. A inner link identification matrix 2630 having N rows and N columns holds the identifiers of the 
Inner-links, N being the number of switch modules 1004. An entry corresponding to row m and column n contains the 
Identifier of the inner link connecting switch modules m and n , 0 < m < N, and 0 < m < N. When m = n, the identifier 
is a NULL value indicated by the symbol "x". 

[0200] The arrays 2610, 2620, and matrix 2630 of FIG. 26 contain virtually static data that change only when the 
35 modular-switch configuration is changed. To allocate a path, requiring one or more time slots per time frame, three 
state maps are needed as Illustrated In FIG. 27. An inlet port state map 2710 has a number of rows equal to the number 
P of inlet ports and a number of columns equal to the number of time slots per time frame, eight in this example. An 
outlet port state map 2720 has a number of rows equal to the number Q of outlet ports and a number of columns equal 
to the number of time slots per time frame. An inner link state map 2730 has a number of rows equal to the number of 
40 inner links and a number of columns equal to the number of time slots per time frame. The maximum number of inner 
links Is N X (N-1 ). With N = 5, the number of inner links would be 20. In this example, however, the number of rows of 
the inner link State map 2730 is 25 (numbered 0 to 24). The'number of rows exceeds the actual number of inner links 
because the null links indicated in the diagonal of the inner link identification matrix 2630 are included for addressing 
convenience. 

45 [0201 ] A data structure comprising the arrays 261 0, 2620 and matrix 2630 of FIG. 26 and the state maps 271 0, 2720 
and 2730 of FIG. 27 can be used to find a path from any inlet port to any outlet port. If the inlet port identification array 
2610 and the outlet port identification array 2620 indicate that the Inlet port and the outlet port belong to the same 
switch module, only the inlet port state map 271 0 and the outlet port state map 2720 need be consulted. For example, 
Inlet port 12 and outlet port 14 belong to switch module 4, and hence only row 12 in the inlet port state map 2710 and 

so row 14 in the outlet port state map 2720 need be examined. This requires examining corresponding time slots in the 
two rows until either the required number of matching time slots is reached or ail time slots have been examined. If 
the inlet port identification array 2610 and the outlet port identification an^ay 2620 indicate that the inlet port and outlet 
port belong to different switch modules, then the inner link identification matrix 2630 is consulted to determine the inner 
link connecting the different switch modules. For example, inlet port 12 and outlet port 12 belong to switch modules 4 

55 and 3 respectively. Note that the inlet and outlet connectivity need not be identical as indicated in the inlet port iden- 
tification arrays 2610 and the outlet port identification array 2620. The inner link identification matrix 2630 indicates 
that link 23 connects inlet switch module 4 to outlet switch module 3. To find a path, row 12 in the inlet port state map 
2710 and row 12 in the outlet port state map 2720 are examined to find matching time slots. As described eartler, a 
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matching time slot is a time slot that is vacant in two or nnore specified ports or links. In order to realize temporal packing, 
as described earlier with reference to FiG. 9, the search for a matching time slot starts from the same reference time 
slot, conveniently selected to be time slot 0 for example, may be considered. When the first matching time slot is found, 
according to the inlet port state map 2710 and the outlet port state map 2720, row 23 in the inner link state map 2730 

5 is examined to determine if the matching time slot detemilned from rows 12 and 12 of the inlet port state map 2710 
and the outlet port state map 2720 is also vacant in row 23 in the inner link state map 2730. If so, the time slot is marked 
as allocable in row 1 2 of the inlet port state map 2710, row 1 2 in the outlet port state map 2720, and row 23 in the inner 
link state map 2730. The process continues until either the number of required matching time slots is met or all the 
time slots have been examined and there is still a pending number, greater than zero, of time slots. If there is a pending 

10 number of time slots, a third-order matching process would be required. A third-order matching process can use up to 
(N-2) paths from the Inlet switch module to the outlet switch module, one path through each swltchnnodule other than 
the inlet and outlet switch modules. 

[0202] Starting with a switch module 2, for example, to establish a path between inlet switch module 4 and outlet 
switch module 3, the inner link identification matrix 2630 is consulted to detenrtine the identifier of the outbound link 

IS from switch module 4 to switch module 2 and the inbound link from switch module 2 to switch module 3. These are 
detemnined as inner link 22 (outbound) and inner link 13 (outbound). This can also be seen from FIGS. 19A and 198. 
If a matching time slot detemnined from row 12 of the inlet port state map 2710 and row 12 of the outlet port state map 
2720 is also free in row 22 and row 13 of the inner link state map 2730, the time slot is allocable and corresponding 
entries in row 1 2 of the inlet port state map 271 0, row 1 2 in the outlet port state map 2720, and rows 22 and 1 3 in the 

so inner link state map 2730. If a matching time slot is found through switch module 2, no other switch module is attempted 
because at most one path can be established during a single time slot. If, on the other hand, a path through switch 
module 2 is not found, another switch module, 0 for example, is considered. The outbound and inbound links are 
determined from the inner link identification matrix 2630 to be inner link 20 (outbound) and inner link 3 (inbound), and 
the process of third-order matching is repeated using row 12 of the inlet port state map 2710, row 12 of the outlet port 

25 state map 2720, and rows 20 and 3 of the Inner link state map 2730. If ail (N-2) path are attempted, successfully or 
otherwise, and if the pending number of allocable time slots is still greater than zero, and if at least one time slot has 
not been examined, then the process is repeated with a new time slot. 

[0203] It is important to note that the outbound links, hence the associated inbound link, for the third-order matching 
processes are preferably selected In a manner that promotes vacancy equalization, as will be described later with 
30 reference to FIG. 29, 

[0204] The path allocation processes can be further simplified if the port numbering scheme described with reference 
to FIG. 19 is adopted. With that scheme, the inlet switch module and outlet switch modules are indicated directly as 
the second of two concatenated words. In addition, since for each time slot in a third-order matching process (N-2) 
candidate paths may be examined, it is convenient to place their state indicators (a state indicator is either 0 or 1) in 

35 consecutive locations in memory for both the outbound inner link and the inbound inner link of a path. This suggests 
the use of separate memory devices to contain the states of outbound links and inbound links, leading of course to 
data-storage duplication but faster processing. The preferable structure of the state maps is illustrated in FIG. 28. The 
structure includes inlet port-state map 2411 , analogous to the inlet port state map 2710, outlet port-state map 2412, 
analogous to the outlet port state map 2720, state map 2413 structured In S x N rows, where S is the number of time 

40 slots per time frame (per calendar) and N is the number of switch modules, and state map 2414 of a similar structure 
to that of state map 2413. Each row in state map 2413 contains the states of N consecutive outbound links, including 
a null entry corresponding to a non-existing link from a switch module to itself. Each row in state map 2413 contains 
the states of N consecutive inbound links, including a null entry. The states of all inner links, in state maps 2413 and 
2414 during a time slot occupy N consecutive rows, as indicated in FIG. 28. 

45 [0205] For each scheduler 231 2, the state maps held in memory devices 241 1 , 241 2, 241 3, and 241 4 take the form 
of matrices labeled A, B, C, and D, illustrated in FIG. 28. Each of the four matrices labeled A, B, C, and D includes a 
partition corresponding to each switch module. Each partition has a number of rows equal to a prescribed upper bound 
of a number of ports, in the input side or output side, of a switch module. The upper bound may further be selected to 
be the nearest integer that is a power of 2, and not less than the maximum number of ports per switch module, if 

50 addressing the matrices is based on concatenated numbers as described earlier. Only the used rows, however, are 
indicated in FIG. 28. Each of matrices A and 8 has a number of columns equal to the number of time slots in the time- 
slot band designated for the scheduler. Each of matrices C and D has a number of columns equal to above mentioned 
prescribed upper bound of a number of ports per switch module. 

[0206] FIG. 29 outlines the steps of a process followed by one of the selectors 2312 in the path selection deyice 
55 1106 to find a number of matching time slots. When the scheduler 2312 receives a request (step 2910) either from the 
previous scheduler 2312 or, in the case of the head scheduler 231 2-1 , from 1 :2 distributor 2304 (FIG. 23), the scheduler 
extracts the connection parameters: j.m, k.n, q, and type, as described above. The time-slot t is set to zero (step 291 2). 
The inlet state (0 or 1), stored in A(j.m, t) and the outlet state (0 or 1), stored in B(k.n. t). are examined (step 2920). If 
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either is busy (state 1), a succeeding time slot within the time-slot set covered by the scheduler 2312 is sought (step 
2970). If step 2920 is successful (i.e., if both the inlet port and the outlet port are free (state 0)), it is then required to 
determine whether the connection is an intra-module connection by comparing m and n (step 2922). If m = n, a result 
record {NULL, t), where the first field indicates a link identifier and the second field indicates a relative time-slot identifier 

5 t. is generated (step 2932). The number, q, of pending time slots is reduced by 1 (step 2934) and corresponding entries 
in state matrices A and B are updated to indicate a busy state (step 2960). the result record is written in result buffer 
(step 2962) and, if q is still greater than zero (step 2964), a subsequent time slot is sought (step 2970). If, in step 2922, 
m and n are detennined to be unequal, and the type of the request is determined to be other than type 2 (step 2924), 
a free time slot along the link from module m to module n is sought (step 2930). Note that, if such a direct link did not 

10 exist, the request would have been placed in buffer 2226 (FIG. 22) corresponding to a type-2 connection. To find an 
allocable time slot in a direct link, the value of C(m, n, t) is examined (step 2930) and if found to be zero (indicating a 
free time slot along link m.n), a result record (NULL, t} is generated (step 2932), the number q of pending time slots is 
then reduced by one (step 2934), corresponding entries in matrices A, B, and C are updated to indicate a busy state 
(step 2960), and the result record {NULL, t) is placed in the result buffer associated with the scheduler 2312 (step 

15 2962). If q is still greater than zero (step b, the remainder of the sub-frame is examined starting with step 2970). If the 
sub-frame is exhausted (step 2974), and q is still greater than zero (as determined from step 2964), the connection 
request is placed in a cascade buffer (step 2980) which is the connection -request buffer2402 of a subsequent scheduler 
2312, and the process is complete, if the scheduler in question is the h*^^ scheduler, the subsequent scheduler has an 
index equal to h+1 . If h = H, the subsequent scheduler is the head scheduler 2312-1 and the cascade buffer of scheduler 

20 H is the fifth input buffer 2228 of the connection-control circuit 1 104. 

[0207] If, in step 2924, the connection type is determined to be 2, then a third-order matching process is activated 
(step 2940). If a path is found for the current time slot t (step 2950), a result record {L, t}, L being the selected outbound 
link from switch module m, is formulated, the number q of pending time slots is reduced by 1 (step 2934), corresponding 
entries in state matrices A, B, C, and D are updated (step 2960)^ and the result record is written in the result buffer 

25 2314 associated with the scheduler 231 2 (step 2962). Note that the matrix D will be discussed in conjunction with FIG. 
30. If the pending number q of time slots is not zero (step 2964). a subsequent time slot is sought (step 2970) and, if 
the subsequent time slot is within the sub-frame of the current scheduler 2312 (step 2974), steps 2920, 2922, 2924, 
2940, and 2950 are repeated. If step 2980 is reached, the connection-request parameters, with the current value of q, 
are written in the connection-request buffer 2402 buffer associated with the subsequent scheduler 2312-(h-i>1). and 

30 the process is complete. 

[0208] Details of step 2940 are given in FIG. 30. In step 3020, a number, £2, of the remaining outbound links Is 
initialized to equal the number, of provisioned outbound links for switch module m. In step 3022, a is reduced by 
1 , and in step 3022, a current outbound link from switch module m is selected by adding 1 , modulo <t>^, to the index 
of the last used outbound link. In step 3040, entries C(m, L, t) and D(n, L. t) are examined to detennine the state of a 

35 two-link path from switch module m to switch module n through outbound link L during time slot t. If the path is free, 
step 2940 is complete and the selected value of L is used by step 2952 in formulating a result record. Othenivise, if in 
step 3040 it is determined that a path is not available, and provided that Q is not zero, indicating that there Is at least 
one more outbound link to be examined, steps 3022, 3024, and 3040 are repeated, and if Step 3050 is reached, a 
NULL outbound link identifier is returned to step 2950 to indicate that a path was not found along any of the outbound 

40 links <t>n, of switch module m. Where it is determined that a path was not found (step 2950), then a subsequent time 
slot (step 2970) is examined. It is noted that one of the outbound links can be a direct link, a path through which 
requires only a second-order time-slot matching process. However, such a link will always indicate a busy. state (C(m, 
L, t) = 1 ) because step 2940 is only activated when a second-order matching process fails in step 2930. 
[0209] In perfonning step 3040. row mxt of matrix C and row nxt of matrix D are preferably read concurrently from 

45 the link-state memory and the reverse link-state memory, respectively and held in registers. It is noted that, in step 
2940, only one outbound link L, 0 < L < <t>^ -1 , can be selected during a single time slot t. In a full-mesh, = N-2 for 
alt switch modules, where N is the number of switch modules. It is desirable to equalize the vacancy among all links 
and, therefore, the initial value of L for successive searches is preferably selected in a cyclic fashion where the initial 
value of L succeeds the value at which the immediately preceding search has ended. 

50 [0210] If, after exhausting all path for each time slot covered by the scheduler 2312, q is still greater than zero, the 
request parameters j.m, k.n, q, and type are placed in the connection-request buffer of the scheduler 2312. If the 
connection type is 2 and the scheduler 2312 is the tail scheduler 2312-H, the process is considered complete and, 
regardless of the value of q (0 or positive), the acquired number of allocable time slots are reported to the controller 
of the switch module that requested the connection. The switch-module controller can assign the time slots to the 

55 specific connection request for which the allocable time slots were sought, or the time slots may be assigned to another 
connection request, for example when q > 0 after processing a given request and a subsequent request can use the 
allocable time slots. 

[021 1] The memory that would be required to maintain an indication of the state of every port of every switch module 
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1004 in the modular, high-capacity, optical core node 1100 is divided into a number of memory partitions equal to the 
number of stages in the path selection device 1106 of FIG. 23. FIG. 28 illustrates, in further detail, memory devices 
2408 associated with a single scheduler 2312 (see FIG, 24). The memory devices 2408 include the inlet port-state 
memory 2411, the outlet port-state memory 2412, the outbound port-state memory 2413 and the inbound port-state 
5 memory 2414. 

[0212] As is indicated in the inlet port-state memory 2411, time slots 1 . 4, 5 and 7 are free for the inlet port 1016 
labeled 7 (see FIG. 20A) of the second switch module 1004(1). Additionally, as is indicated in the outlet port-state 
memory 241 2, time slots 1 , 3 and 7 are free for the outlet port 1 01 8 labeled 21 (see FIG. 20B) of the fifth switch module 
1004(4). As the inlet port 1016 labeled 7 and the outlet port 1018 labeled 21 are not part of the same switch module 

10 1004, a compound matching process is required. It should first be noted that the inlet and outlet ports 1016, 1018 of 
interest have free time slots 1 and 7 in common. The outbound port-state map of outbound port-state memory 2413 
and the inbound port-state map of inbound port-state memory 2414 may then be consulted to select a path from the 
second switch module 1004(1) to the fifth switch module 1004(4). Specifically, the state maps relating to time slots 1 
and 7 contained in these memories are reviewed. 

15 [0213] A review of the state maps relating to time slot 1 reveals that a direct inter-modular link is available from the 
outbound port 1008 labeled 9 of the second switch module 1004(1 ) to the inbound port 1006 labeled 9 of the fifth switch 
module 1004(4). A result record arising from this path selection would read {9, NULL, 1}. 

[0214] If the path selected in time slot 1 had not been available, or if a second time slot is requested by the time- 
slot-allocation request of interest, the state maps relating to time slot 7 may be reviewed. As indicated, an inter-modular 
20 link is available from the outbound port 1008 labeled 8 of the second switch module 1004(1) to the inbound port 1006 
labeled 8 of the fourth switch module 1004(3). Additionally, an Inter-modular link is available from the outbound port 
1008 labeled 19 of the fourth switch module 1004(3) to the inbound port 1006 labeled 19 of the fifth switch module 
1004(4). A result record arising from this path selection would read {8, 19, 7). 

[0215] A switch module can be implemented as a space switch which may switch optical signals from any input port 
25 to any output port. Recall that an input port can be an inlet port or an inbound port, and an output port can be an outlet 
port or an outbound port, as defined earlier. An optical signal may occupy a single wavelength channel or several 
wavelength channels, if all optical signals at input are in the same wavelength band, then any input signal can be 
switched freely to any output port. A switch module may also be implemented as a star-coupler where the input signals 
must have non-overlapping wavelength channels. Each input port must then be provided with a wavelength converter 
30 to shift the wavelength band of the optical signal received by the input port to correspond to a desired wavelength 
channel at an output port. (A wavelength channel carries a modulated wavelength which occupies a wavelength band). 

Balancing the processing loads of the schedulers 

35 [0216] Each scheduler 2312 attempts to allocate time slots for connection requests waiting at the corresponding 
connection-request request buffer 2402, where the connection-request buffer of the head scheduler includes the four 
buffers 2222, 2224, 2226, and 2228. Each of the H schedulers is provided with state maps covering a prescribed set 
of time slots of the slotted time frame (i.e., a prescribed set of cells in a calendar). The H sets of time slots assigned 
to the H schedulers are nonintersecting. The time slots within each set need not occupy consecutive positions in the 
40 slotted time frame. Each scheduler assigns consecutive numbers to its assigned time slots and associates each of the 
numbers with a corresponding actual position of the time slot in the slotted time frame. A scheduler reports the actual 
time-slot positions of allocated time slots. The head scheduler receives fresh connection requests from the N switch- 
module controllers in addition to continuation connection requests from the tail scheduler after attempting to allocate 
a required number of time slots through a direct link from an inlet switch module to an outlet switch module, using a 
45 second-order time-slot matching process. A scheduler dequeues connection requests waiting in its corresponding 
connection -request buffer 2402 and attempts to schedule them within its assigned set of time slots. A scheduler proc- 
esses one request at a time. The workload of the schedulers may vary appreciably, and the throughput of the assembly 
of schedulers, i.e., the mean number of scheduled time slots, is influenced by overloaded schedulers. The highest 
throughput is realized when the schedulers' workloads are equalized. The workload of a scheduler can be adapted by 
50 modifying its assigned subset of time slots. The occupancy of a connection-request buffer 2402 corresponding to a 
particular scheduler 2312 is an indicator of its workload. To balance the occupancies of the connection-request buffers 
2402 of the H schedulers 2312, the master controller 1102 (FIG. 11) can monitor the occupancy of each of the H 
connection-request buffers and determine adjustments (increments or decrements) of the sizes of the time-slot subsets 
assigned to some of the H schedulers 231 2. These adjustments are preferably adaptive, possibly changing with chang- 
es ing traffic composition. 
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Multiple connections 

[0217] A connection request received by a switch -module controller may specify an inlet port, an outlet port, and a 
number of time slots per time frame. A connection request may also specify a fonward connection from an inlet port to 

5 an outlet port and a return connection from the outlet port to the input port, with different numbers of time slots per time 
frame in the forward and return connections. To enable the creation of virtual networks within a given network, a con- 
nection request may further specify multiple pairs of inlet ports and outlet ports with different time-slot allocation for 
each pair. Such a multiple-connection request would be Initiated by an edge node that is managing a virtual network, 
in which case the controller of the switch module receiving the request would interpret the request and translate its 

10 connectivity requirement into a list of inlet-outlet-port pairs. 

Nodal Capacity 

[0218] With each port operating at the same nominal bit rate, and with a sufficient internal expansion, the access 
15 capacity of the modular switch is determined by the nominal bit rate times the lesser of the number of inlet ports and 
the number of outlet ports. If each inlet port receives only one wavelength channel, then the capacity of the modular 
switch is determined as the number of ports times the capacity per channel. Higher capacities may then be realized 
by using parallel switch modules. 

20 Capacity of scheduler array 

[021 9] The rate at which a scheduler processes connection requests waiting at its connection request buffer depends 
heavily on the spatial distribution of the connections. An infra-module connection requests specifying inlet and outlet 
ports belonging to the same switch module requires a simple first-order time-slot matching process. An inter-module 

25 connection specifying inlet and outlet ports belonging to different switch modules may be allocated through a direct 
link using a second-order time-slot matching process, if the connection requests from each switch module specify an 
equal number of time slots to each of the other (N-1) switch modules, then the inter-module connections can be ac- 
commodated on direct links, and the processing-intensive third-order time-slot matching would not be required. If the 
connection requests from each inlet switch module specify a small number of outlet switch modules, resulting in direct- 

30 link overload, then a large proportion of inter-module connections must be accommodated through intermediate switch- 
ing modules, requiring third-order time-slot matching. 

[0220] As described eartier a third order time-slot matching process requires comparing four calendars per candidate 
route. There are (N-2) candidate routes and examining all candidate routes requires {N-2) x i'j, state inspection proc- 
esses, each executed in A time units, where is the size of time-slot subset assigned to scheduler h, 1 < h < H. For 

35 s = 1 ,024 time slots per time frame, and H = 1 6, and if the time-slot subsets are of equal size, then H^^, - 64. With N = 
32 switch modules, and A = 50 nanoseconds, the maximum duration of a third-order time-slot matching process within 
scheduler h would be about 100 mrcroseconds, and the mean value may be as high as 80 microseconds. Estimating 
the processing time of an intra-module connection within the switch module to be four microseconds and the processing 
time of an inter-module connection accommodated through a direct inner link to be eight microsecond, and considering 

40 a spatial distribution where intra-module requests constitute 0.2 of all requests and inter-module requests accommo- 
dated through direct links constitute 0.4 of all requests, then the weighted mean processing time per connection is (0.2 
X 4 + 0.4 X 8 + 0.4 X 80) = 36 microseconds, and the scheduler throughput would be 27,700 basic connections per 
second. A basic connection requires one time slot, and the processing effort of a connection requiring y time slots is 
approximated as y x a. Increasing the number of schedulers from 16 to 32 reduces the weighted mean processing 

45 time per connection to 1 8 microseconds and, hence, increases the throughput to 55,400 connections per second. 
[0221] It is noted that the occupancy of the modular switch decreases gradually along the time frame, with the highest 
occupancy expected at the reference time slot (time slot 0 for example). The sizes of the time-slot subsets assigned 
to the H schedulers may be adjusted to equalize the workload. 

so Hybrid modular switch 

[0222] A modular switch can be constructed using both electronic and optical switch modules. FIG. 31 illustrates a 
modular switch structured as a mesh of electronic switch modules 3120 and optical switch modules 3140. Each elec- 
tronic switch module 3120 preferably has a buffer 3122 at each inlet port and each inbound port (I.e., at each input 
55 port). A buffer is desirable at each inlet port to relax the requirement of time coordination with the source nodes. A 
buffer is needed at each inbound port in order to reduce the order of time-slot matching as illustrated in FIG. 32. In 
addition, if an outbound port of an electronic switch module 3120 connects to an inbound port of an optical switch 
module 3140, then it is also preferable that the outbound port be provided with a buffer 3124 to enable decoupling the 
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matching processes at the electronic and optical switch modules and, hence, further reduce the order of the matching 
process. If a switch module in the modular switch is an electronic switch, the switch module need not be time-locked 
to the source nodes to which it connects. Time-alignment can, instead, be realized using the buffer 3122 at each input 
port of the switch module 3120. 

5 [0223] FIG. 32 indicates the matching processes required in an internal path traversing two or three switch modules, 
where a switch module can be electronic or photonic. The highest order of matching processes is detemiined by the 
number of consecutive optical switching modules in the path. If a path traverses three switch modules where the middle 
switch module in the path is electronic^ then three independent first-order matching processes are required. If only the 
first or the third switching module is an electronic switch, then two independent matching processes, are required, one 

10 of which being a second-order matching process and the other a first-order matching process. It is preferable to start 
with the higher-order matching process in the search for matching time slots. 

All-Electronic modular switch 

15 [0224] A core node 1200 (FIG. 12) may comprise only electronic switch modules. To realize a modular switch of 
moderate capacity, of the order of 1 0 Terabits per second for example, common-memory switch modules may be used, 
each having 64 input ports and 64 output port. The 64 input ports would be divided into 24 inlet ports and 40 inbound 
ports, and the 64 output ports would likewise be divided into 24 outlet ports and 40 outbound ports. The 40:24 expansion 
is provided to offset the effect of spatial traffic imbalance which forces some inlet-outlet traffic streams to use two-link 

20 internal paths. The mesh structure of FIG. 4 or FIG. 12 would then comprise a maximum of 41 switch modules and 
the total number of input ports or output ports is then 24 x 41 = 984. With each input or output port operating at 10 
Gb/s, the total capacity of the modular switch would be 9.84 Terabits per second. To realize a modular switch of very- 
high capacity, a rotator-based switch having 512 input ports and 512 output ports can be used as a switch module. 
With the 512 input ports divided into 192 inlet ports and 320 inbound ports, and the 512 output ports divided into 192 

25 outlet ports and 320 outbound ports, the maximum number of switch modules in a mesh structure (FIGS. 4 and 12) is 
321 . The maximum number of input ports or output ports in the modular switch becomes 1 92 x 321 = 61 ,632, and 
with each port operating at 10 Gb/s, the total capacity of the modular switch is about 616 Terabits per second. The 
rotator-based switch is described in United States patents 5,1 68,492 and 5,745,486 issued on December 1 , 1 992 and 
on April 28,1998 to Beshai et al. 

30 [0225] In summary, time-sharing switching can not be easily implemented in a network in which an optical signal has 
to traverse two or more optical switches. However, a set of intemiediate-sized optica! switches may be used in con- 
junction with a multi-stage configuration. Although path finding in a time-shared, multi-stage, bufferiess, optical core 
node can be arduous, the present invention provides hardware for a relatively fast performance of the required extensive 
processing. The overall efficiency of a network based on high-capacity core nodes is enhanced by reducing the internal 

35 blocking of the core nodes, thus pemnitting high occupancy of the input links and output links of the core nodes, and 
by increasing the scheduling capacity of each core node, thus permitting fast redirection of connections. 
[0226] Other modifications will be apparent to those skilled in the art and, therefore, the invention is defined in the 
claims. 

40 

Claims 

1. A modular switch comprising; 

45 a plurality of switch modules, each of sard plurality of switch modules Including: 

a plurality of inlet ports, each of said plurality of inlet ports adapted to communicatively couple to an optical 
link from an external node; 

a plurality of outlet ports, each of said plurality of outlet ports adapted to communicatively couple to an 
optical link to an external node; 

a plurality of outbound ports, each of said plurality of outbound ports communicatively coupled to an inter- 
modular link to an other of said plurality of switch modules; 



55 



a plurality of inbound ports, each of said plurality of inbound ports communicatively coupled to an inter- 
modular link from an other of said plurality of switch modules; and 
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a master controller including a path selection device adapted to match vacant time slots in calendars associated 
with particular ones of said inlet ports, said outlet ports and selected outbound ports of said switch modules 
in response to receiving a connection request, said connection request specifying an inlet port, an outlet port, 
and a number of time slots in a time-division-multiplex frame. 

The modular switch of claim 1 further comprising 

a plurality of switch-module controllers, each of said plurality of switch-module controllers communicatively 
coupled to an associated one of said plurality of switch modules and to said master controller, each of said 
plurality of switch-module controllers operable to: 

receive said connection request from one of said external nodes via said associated one of said plurality 
of switch modules; 

send said connection request to said master controller; 

^ receive a schedule from said master controller, where said schedule is generated by said master controller 
responsive to receiving said connection request; and 

send, to said associated one of said plurality of switch modules, commands adapted to configure said 
associated one of said plurality of switch modules according to said schedule. 

The modular switch of claim 2 wherein each of said plurality of switch modules further comprises: 

a controller input port adapted to receive communication from said associated switch-module controller; and 

a controller output port adapted to transmit communication to said associated switch-module controller. 

The modular switch of claim 3 wherein each of said switch modules is an optical switch module and each of said 
switch-module controllers is further operable to exchange time-locking data with said external nodes via said as- 
sociated one of said plurality of switch modules, said time-locking data comprising readings of time counters pro- 
vided at said extemat nodes and said modular switch. 

The modular switch of claim 3 wherein at least one of said plurality of switch modules is an electronic switch module 
and each of said inlet ports and inbound ports associated with said electronic switch module includes a data buffer. 

The modular switch of claim 5 wherein at least one of said plurality of switch modules is an optical switch module 
and said electronic switch module has at least one outbound port that Is communicatively coupled to an Inter- 
modular link to said optical switch module, each of said at least one outbound port including a data buffer. 

A method of controlling an optical switch module in a modular optical switch, said method comprising: 

receiving a connection request, from an edge node, via said optical switch module, where satisfying said 
connection request requires use of an inter-modular link to another optteal switch module In said modular 

optical switch; 

sending a time-siot-allocation request to a master controller of said modular optical switch, where said time- 
slot-allocation request is based at least in part on said connection request; 

receiving a plurality of connection schedules, from said master controller in response to said sending said 
time-slot-allocation request, wherein each of said plurality of connection schedules is associated with a port 
of said optical switch module; and 

sending, to said optical switch module, commands adapted to configure an intemal connectivity of said optical 
switch module according to said connection schedules. 

The method of claim 7 further comprising receiving a release request specifying an inlet port, an outlet port, and 
identifiers of timeslots to be released. 
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9. The method of claim 8 further comprising receiving a release request specifying the release of all time-slots allo- 
cated to a path from an inlet port to an outlet port. 

10. The method of any of claims 7 to 9 further comprising sending each of said connection schedules to an appropriate 
one of said plurality of edge nodes. 

11. The method of any of claims 7 to 10 further comprising translating said connection request, where said connection 
request specifies a source edge node and a destination edge node, to a time-slot-allocation request, where said 
time-siot-allocation request specifies an inlet port, an outlet port, and a number of time slots per calendar period. 

12. A modular switch comprising: 

a plurality of switch modules, each of said plurality of switch modules including: 

a plurality of inlet port means for communicatively coupling to an optical link from an external node; 

a plurality of outlet port means for communicatively coupling to an optical link to an extemal node; 

a plurality of outbound port means for communicatively coupling to an inter-modular link to an other of 
said plurality of switch modules; 

a plurality of inbound port means for communicatively coupling to an Inter-modular link from an other of 
said plurality of switch modules; and 

a master controller including a means for matching vacant time slots in calendars associated with particular 
ones of said inlet port means, said outlet port means and selected outbound port means of said switch modules 
in response to receiving a connection request, said connection request specifying an inlet port means, an 
outlet port means, and a number of time slots in a time-division -multiplex frame. 
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